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Conventions 


Numbers like 1.1.26 refer to the sütras of Astadhyayr. Sanskrit words and sen- 
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mar are in normal font. Terms like sound-sets that are introduced by me for 
the description of the formal framework are in typewriter font. Indicatory 
"jt-markers" appear in brackets, e.g. ti(p). Phonemes for the sake of pronun- 
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Preface 


The present work is a study of the Astadhyayi of Panini from a new perspec- 
tive and is an adapted version of my doctoral dissertation with the same title. 
It attempts to explore the Pàninian grammar from a formal point of view 
and investigate the possibilities of representing it in a logical, explicit and 
consistent manner. Such a representation requires an appropriate framework. 
A formal framework would facilitate adequate tools for postulating and 
evaluating hypotheses about the grammatical system. Moreover, it would 
furnish the basis for a computer implementation of the grammar. Both 
aspects, namely a formal representation and computer implementation of the 
Astadhyayi, are objects of enquiry in the field of theoretical studies on Panini 
as well as the emerging discipline of Sanskrit computational linguistics. This 
book takes on the ground-work in these areas. 


The propositions that I put forward in this book are a result of my ex- 
perimentations with the Astadhyayi. Over the last few years, I tried a 
number of models to comprehend the content and processes of the Pàninian 
system. Beginning with the initial aim to automatize Astadhyayr, I examined 
the various challenges and issues accompanying this and in the process 
graduated to work on the development of a formal framework for the gram- 
mar. The outcome of this ongoing process is summarized in the present work. 


There are several excellent expositions of Astadhyayi by the scholars in 
this field and these are evidently the chief source of my understanding on this 
subject. At the same time, there is hardly any significant work on Panini from 
a formal perspective. The relatively limited writings available are largely in 
the nature of unproven hypotheses with few exemplary comparisons and 
usually with claims to show that Panini has been the forerunner in matters 
of logic, mathematics and computer-science. The present work does not 
strive for any of the above objectives. It is not an attempt to compare the 
Paninian system with modern grammatical theories. Neither is it an attempt 
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to establish Panini as the source of the concepts and methods followed by 
modern computing systems. Neither does it claim that Sanskrit is the most 
suitable language for computers. 


The present study attempts to render the Paninian system of Sanskrit 
grammar in a framework which consists of unambiguous, consistent and 
explicit categories. Only then can it be conveyed to logical systems like 
modern day computers. Differing from the formulation of Astadhyayr, which 
is composed in an artificial yet natural language and is meant to be employed 
by individuals who are acquainted both with the Sanskrit language and 
the techniques of grammar, the present rendering aims for a non verbal 
representation in terms of mathematical categories and logical relations 
which can be implemented in an algorithmic manner. 


The process of formalization, however, involves determination of the 
underlying principles regarding the functioning of grammar. My first 
response was to look into the explanatory literature associated with the 
Astadhyayi and on this basis to decide upon the general principles that may 
lead towards formalization. Although a careful study of the literature is of 
immense importance to enable understanding of the various issues, in my 
case it did not suffice to devise a solution for formalization, based only upon 
comparative studies of Paninian literature. The reason lies primarily in the 
different nature of the task at hand. Later literature on Astadhyayr is primar- 
ily explanatory. Although there are significant attempts to clarify, uphold 
and sometimes rectify the grammatical corpus, there is hardly any effort to 
render it in a new formal setup. This is evident because the Astadhyay is 
(to a significant extent) a formal presentation of the grammar. And a very 
brilliant one indeed.” 


The remarkable success of Astadhyayi had the consequence that the 
main effort of later grammarians was directed towards keeping it intact. 
Apart from the attempts by grammarians like Ramacandra (late 14th-15th 
century) and Bhattoji Diksita (late 16th-17th century) to reorganize the rules 
or sütras of Astadhyayi with process or application (prakriya) as the main 
focus, there are hardly any works dedicated to recasting Astadhyay1? 


1 On the explanatory nature of the literature on Astadhyayr, see (Bhattacarya 1955 p. 123- 
132) and for a bibliographical note (Cardona 1980 p. 278-293). 

? Patafijali considers Panini to be an "embodiment of authority" ATR smmari: | (MB on 
1.1.1) regards "the sütras of Panini as beautiful" aile ag iuri: Spe He: | (MB on 2.3.66) 
and postulates that "just because of doubtful appearance of a rule, it should not be rejected, 
but should be made precise on the basis of reasoned explanation" STENA RaRa 
ESAT. (PB. 68). The Paniniyas or the scholarly followers of Panini formulate a number 
of conventions to keep intact the systematic coherence of Astadhyayr, see (Wujastyk 1993). 
? Bhattoji Diksita's Siddhantakaumudi is a reorganization of the Paninian rules. Compared 
to the earlier attempt of Ramacandra in his Prakriyakaumudi, Bhattoji Diksita sticks to the 
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The question of formalization, in terms of a logical language which a com- 
puter program without any knowledge of Sanskrit could understand, was 
naturally not a requirement at that time either. Formal representational 
techniques that are being evolved today and increasingly being employed to 
develop machines and computers were missing, and it would be anachro- 
nistic and wrong to expect Panini (or anyone) to anticipate the requirements 
and expectations of a computer program trying to implement it two-and- 
a-half millenia later. It should be noted here that I am neither denying the 
formal nature of Astadhyayr, nor examples of precursors of several modern 
techniques in it^ However, what is certain is that Astadhyayr, as we have 
it today, would require considerable additional information organized in a 
suitable manner in order to make it executable as a computer program.? 


As a consequence, a new methodological approach becomes necessary —the 
scientific method of observation, hypothesis and testing in order to develop a 
new formal representation of the Astadhyayi.° The present work is primarily 
based on this methodological approach. Unlike most of the publications on 
Paninian grammar, I do not attempt to comment upon an issue by collecting 
and analyzing the views of the traditional scholars or the Paniniyas. Although 
an important task, this is hardly fruitful in my case. The reason for this is 
that I am confronted with a challenge of our time and the Paninian tradition, 
because of its antiquity, had no occasion to comment on it. On the other hand, 
my investigations are directly based on the corpus of Astadhyayi. My use of 
later Paniniyas is in order to understand the tradition and to relate them to 
important premises of my hypotheses. 


It could be asked here: how justified I am in proposing systematic frame- 
works that are not directly supported and employed by the Paniniyas? I feel 
that Astadhyayt is a text which not only allows but invites such an approach. 
Its composition is the result of an empirical observation and systematic 
organization of the linguistic features. It is an appropriate case for studying 
the methods of comprehending linguistic phenomena through developing 
systematic structures. 


Another ground for formalization is that it has both theoretical and practical 
relevance in the field of Paninian studies. Practically speaking, it facilitates 
better access to the content and processes of the Pàninian system, not just to 
experts in this field, but to non-Sanskritists as well. Theoretically speaking, it 


Paninian rules only. He also covers all the rules of the Astadhyayi. See (Cardona 1980 p. 285- 
288) for bibliographical notes and (Houben 2008 p. 563-574) for the reasons for the tremen- 
dous success of Bhattoji Diksita. 


4 See section 1.1 for recent works on the formal character of Paninian rules. 
5 | will discuss this in section 3.2. 
6 On the scientific approach, see (Wilson 1952) and (Popper 1959). 
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prompts us to look at the Astadhyayi from a formal perspective. This entails a 
critical examination of the content and processes of the grammatical system. 
Strategies for evolving such a representation involve a reworking of the oral 
framework in which Astadhyayi is composed. This leads to identification, 
analysis and determination of issues related to organization and application 
of Paninian rules. A formal representation also facilitates precise formulation 
and testing of hypotheses regarding some of the fundamental issues of the 
Paninian system—meta-linguistic conventions (paribhasas), brevity (laghava), 
rule organization (adhikara and anuvrtti), ordering and application of gram- 
mar (prakriya), functioning of the system etc. In short, it opens up a new 
paradigm for Paninian studies. 
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Chapter 1 


Introduction 


The study of Astadhyayi can be classified into three broad areas of academic 
research: 


1. Analysis of the grammatical corpus in order to understand its organiza- 
tion and functioning, 


2. formalization of the grammatical system, and 
3. its computer implementation or automation! 


The present work deals with the latter two areas, namely, formalization and 
computer implementation of Astadhyayr. It seeks to study the content and 
processes of the Paninian system of Sanskrit grammar and re-present them in 
terms of logical relations and operations. A formal representation is attempted 
in order to facilitate an examination of the underlying grammatical structures. 
It also enables an implementation of the grammatical processes on computer. 


1.1 Earlier research 


In the past few decades there has been an increased interest in studying the 
Astadhyayi from a formal perspective. Scholars like Vidya Niwas Misra (1964, 
1966), M. D. Pandit (1966, 1974), Frits Staal (1965, 1966) and Paul Kiparsky 


1 According to Frits Staal (1966 p. 209): "If we distinguish three stages in the study of Astad- 
hyayi as a generative device, it may be held that the first stage, that of analysis, has been 
dealt with successfully by Indian commentators since Patafijali and by Western scholars of 
the last two centuries; however, this task is by no means completed. The second stage, that 
of formalization, has perhaps just begun to receive attention; it depends on analysis, but is 
not determined by it. The third stage, that of automation, itself depending on formalization, 
is not determined by formalization; it may not even be effectively realizable." 
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(1969) published research papers and monographs showing that certain for- 
mal features of mathematics and modern linguistic theories like context- 
sensitive rules or elements of generative transformational grammar are al- 
ready present in it. These studies further supported the initial fascination for 
the Astadhyayi as "one of the greatest monuments of human intelligence" 
and "an indispensable model for the description of languages" (Bloomfield 
1929 p. 268). In the year 1985 Rick Briggs, a NASA scientist, published a pa- 
per on "Knowledge Representation in Sanskrit and Artificial Intelligence" in 
which he compared the system of karakas with representational techniques in 
Artificial Intelligence and posited that: 


Among the accomplishments of the grammarians can be reckoned a method for para- 
phrasing Sanskrit in a manner that is identical not only in essence but in form with 
current work in Artificial Intelligence? 


This statement is illustrative of the aim of extracting the techniques of 
representation in the Paninian grammar that can be fruitfully employed 
for computational processing. Following this, a "National Conference on 
Knowledge Representation and Inference in Sanskrit" was organized in 
Bangalore in December 1986, “to extract this hidden ‘algorithm’ of automatic 
semantic parsing from the Sanskrit pandits” (Briggs 1987 p. 99). A group of 
scholars from the Indian Institute of Technology, Kanpur undertook projects 
incorporating Pàninian perspectives, especially the kZzraka-system, with 
modern techniques of Natural Language Processing (NLP). Their aim was to 
develop a machine translation tool for English and Hindi based on insights 
gained by the Astadhyayi of Panini (Bharati 1994). The work initiated is 
followed by the “AnusAraka” Language Resource Development project. This 
is still an ongoing project, and once completed, should “allow users to access 
text in any Indian language, after translation from the source language (i.e. 


English or any other regional Indian language)" ? 


The nature of the above efforts has been to utilize some of the insights 
from the Pàninian grammar and apply them to the standard techniques of 
Natural Language Processing. The next step comes from researchers working 
in the field of computational linguistics. A general opinion which is often 
articulated here is that Sanskrit is one of the most suitable languages for 
computers. This is normally grounded on the assumption that it is a well- 
structured language which in turn is justified on the basis of the algebraic 
rules of its grammar. Some scholars opine that Sanskrit, being a perfect 
language, with a grammar like Astadhyayr, comes closer to a computer 


? Briggs’ analysis is based on the Vaiyakaranasiddhantamafijisa of Nagesa Bhatta (1730- 
1810 C.E.). See (Briggs 1985 p. 32-34). 

3 The partner institutions of this project are: Chinmaya International Foundation (CIF) 
Shodha Sansthan, Kerala; Language Technologies Research Centre, IIT Hyderabad; Depart- 
ment of Sanskrit Studies, Hyderabad University. For more information, see the project web- 
Site: http: //ltrc.iiit.ac.in/-anusaaraka/ (accessed on 24.10.2015). 
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language and in future even computer programs could be written in Sanskrit. 
A summary of the approaches followed by the ongoing research projects on 
computerization of the Astadhyayr, however, shows that as yet there are no 
finished automated systems or programs that implement the whole corpus 
of Astadhyayi. 


A first effort in this regard is the creation of an electronic version of the 
corpus of Astadhyàyr which was prepared by Dr. Shivamurthy Swamiji 
of Sri Taralabalu Jagadguru Brihanmath, Sirigere, Karnataka. He calls it 
Ganakastadhyayi meaning “computer software on Astadhyayi".^ It contains 
the Sütrapatha, Padapatha, anuvrttis, vrtti from Siddhantakaumudi and 
Laghusiddhàntakaumudi (incomplete), French translation by Louis Renou 
as well as inflectional tables for nominal and verbal stems, including step-by- 
step analysis of Panini's sütras, applied to produce different forms of nominal 
stems. Shivamurthy Swamiji is also developing a rule based application of 
the Paninian derivational process." 


Another database of examples (udaharana) found in the four major commen- 
taries of the Astadhyayi—namely the Mahabhasya, Kasikavrtti, Bhasavrtti 
and Siddhantakaumudi—is prepared by the French Institute of Pondicherry. 
These are published in printed form as well as CD-ROM version including 
books on collection of examples (udaharana-samahara), on compounds (samasa 
prakaranam) and on verb inflections (tirianta prakaranam) (Grimal 2005, 2006, 
2006a and 2010). 


A digital edition of the Astadhyayi is being prepared by Wiebke Pe- 
tersen under the project: “Pratyaharas or features? A qualitative analysis of 
phonological descriptive techniques—a comparison of Panini’s pratyaharas 
and phonological features". Attempts to implement the content and pro- 
cesses of the Astadhyayi are relatively recent and only a few in number. Most 
of them base themselves upon the research and publications in the area of 
formalization of the Pàninian grammar. It is imperative, therefore, to first 
look into the outcome of the investigations in this field. 


4 Information accessed from http: //www.taralabalu.org/panini/ on 02.11.2015. 

5 During his visit to Heidelberg on 17.05.2013 he showed me his application for the de- 
clension of nominal stems which he hopes to finish in near future. He told me that he is 
attempting to follow the exact process of Astadhyayi, although I did not had the oppor- 
tunity to look into the program codes. Thus far, there is no publication on the manner in 
which it is implemented. 

$ For more information about the ongoing project, see the project website (accessed 
on 09.11.2015): http://www.ifpindia.org/Paninian-Grammar-through-its-Examples. 
html 

7 More information at the website: http://panini.phil.hhu.de/?section-home (ac- 
cessed: 22.02.2016). 


10 1 Introduction 


Apart from a few early publications that explored the mathematical as- 
pects of Panini? the tone of the research towards formalization of the 
Astadhyayi was set by developments in the generative grammar approach of 
Noam Chomsky in late fifties and early sixties. Chomsky declared Panini’s 
grammar to be the first and earliest version of a generative grammar? 
The idea of a formal grammar of language that can generate an infinite 
number of utterances with a limited set of grammatical rules and a finite set 
of terms, evoked a close parallel with the Astadhyayi. Accordingly, some 
Paninian experts published papers with the prime aim of comparing and 
showing the presence of Chomskyan findings in the grammatical system of 
Panini. In a paper written in 1965, Murray Fowler attempted to test whether 
Panini’s rules can be ordered in a manner so that they can be implemented 
through a Finite State Automaton (Fowler 1965 p. 44-47). This corresponds 
to the Type-3 or regular grammars in the Chomsky hierarchy.'° Frits Staal 
promptly corrected this assumption in a brief communication in 1966, and 
showed that the way Panini's rules are conceived and organized, it would 
not be possible to equate them with a regular grammar (Staal 1966 p. 206-209). 


Staal further showed parallels with the Type-1 or context-sensitive gram- 
mars and certain phonetic rules for replacements of sounds in the Pàninian 
grammar. In the year 1965, he published an article on the "Context-sensitive 
rules in Panini" (Staal 1965). He selected rules from the sixth chapter of the 
Astadhyayi, mainly from 6.1.71 to 6.1.109. The rule iko yanaci!! is a typical 
example. He took the terminology from Chomsky" and described this 
phenomenon using the representation 


a[b > c]d 


8 These include publications by Misra (1964 p. 157-178) and Pandit (1966, 1974 p. 179- 
192). These are, however, mostly of the nature of detecting some mathematical similarities. 
Among the early publications are the articles of Klaus Mylius (1980 p. 233-248) on the appli- 
cation of mathematical methods in the Vedic research which discusses mostly the statistical 
methods as also Madhav Deshpande (1992 p. 15-27) comparing the Paninian features with 
developments in computational linguistics. 

? The classical work of Chomsky in this regard is his book “Syntactic structures" (Chomsky 
1957). Chomsky e.g. speaking at the Asiatic Society of India, Kolkata on 22.11.2001 tells that 
"the first generative grammar in the modern sense was Panini's grammar" (Chattopadhyay 
2001 p. 18). 

10 The grammars of a formal language are put in a hierarchy called the Chomsky hierar- 
chy. The Type-0 corresponds to unrestricted grammars, Type-1 to context-sensitive gram- 
mars, Type-2 to context-free grammars and Type-3 to regular grammars. Chomsky hierar- 
chy plays an important role in the area of formal languages which have special application 
in computer science, see (Chomsky 1956 p. 113-124, 1959 p. 137-167 ; Chomsky and Shützen- 
berger 1963 p. 118-161). 

11 gA aA 8.2.09 ll > ik is replaced by yan if it is followed by ac. 

12 Staal provides the reference: (Chomsky 1963 p. 294). 
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where a is left context, d is right context, b is sthanin or to be replaced and 
c is ddega or replacement. In other words b is replaced by c.?^ He showed 
that Paninian meta-language can even represent the process of substitution 
for more than one phoneme in a collective manner. This is demonstrated by 
the convention for respective correspondence of two lists of equal cardinality 
stated by the rule: yathasamkhyamanudesah samanam'*. This would be equiva- 
lent to the following representation: 


a[bi ...bn > €... cnld 


Here, b; is replaced by c;, bz is replaced by c» etc. Further, he notes that if the 
contexts remain the same, then they need not be repeated every time and the 
idea of anuvrtti (carrying over to subsequent rules) is applied in the grammat- 
ical corpus to present them in a more succint manner. Thus, 


ailbı > ci]di 
ailb2 — c2]d2 


can be represented in a more concise manner as follows: 


ailbı > ci]di 
[be — c]d» 


Staal extended the comparison beyond the phonetic rules and published 
a paper in which he showed that the methods of generative grammar are 
similar to the syntax of nominal compounds in Sanskrit (Staal 1966a p. 198). 


The comparison and motivation from the generative grammar was ex- 
tended to the syntactic and semantic relations in Panini in a paper published 
jointly by Paul Kiparsky and Frits Staal in 1969. In this paper, the authors 
proposed that Panini’s grammar is a system of rules for converting semantic 
representations of sentences (concepts like "agent", "goal", "location") into 
phonetic representations (case endings, verbal affixes etc.). This is achieved 
via two intermediate levels which may be respectively compared with the 
levels of deep (underlying) structure and surface structure in a generative 
grammar. The deep level corresponds to the level of karaka-relations such 
as "(underlying) subject", "(underlying) object" and the surface level repre- 
sents morphological categories like nominal cases, derivational affixes etc. 
(Kiparsky and Staal 1969 p. 84). While carrying out the comparisons with the 
generative grammar, they pointed out that there are essential differences as 
well, especially in the manner in which rules are ordered and organized in 
the Astadhyayi and the way constituent structures are used (Kiparsky and 
Staal 1969 p. 105-106). 


13 The use of arrow here is different than the general notation, according to which b > c 
means: c is replaced by b. 


14 ERRORI: TAA ll 2.2.20 ll > respective assignment for equal number of elements. 
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The authors successively worked-out and improved this model, and the 
actual version is stated by Kiparsky in a paper published in 2009 (Kiparsky 
2009 p. 35-37).!° 


The generative approach started by Staal's comparison of context-sensitive 
rules and extended and developed by other scholars like Kiparsky—and 
to some extent acknowledged by experts like Johannes Bronkhorst (1979 
p. 146-157), S. D. Joshi and J. A. F. Roodbergen'é—had a far reaching 
impact on the attempts by later scholars aiming for computerization of the 
Astadhyayi. This will be evident from the following summary of these efforts. 


In the year 1993, Saroja Bhate and Subhash Kak published an article on 
“Panini’s grammar and computer science". They defined a Paninian rule as 
follows: 
A Ps [Panini sūtra] is a single clause proposition consisting of a subject, a predicate, 
and an environment. It is a statement about grammatical features such as a suffix, an 


augment, a substitute, accent, reduplication, elision, and compounding. It is usually 
of the form A is B in the environment C. This can be written in the following formula: 


Ps: A — B (C) 


Here — stands for is or becomes, and ( ) stands for when, A stands for the subject, B 
represents predicate, and C stands for environment. While A and B are the necessary 
components of a sūtra, C is optional (Bhate and Kak 1993 p. 5). 


According to the authors, the three categories A, B and C can be either a single 
member or multiple member categories or a combination of both. An example 
of one member category which they provide is the rule: iko yanaci"". It can be 
represented by 


A — B (C) 
A multi-member category for A is the rule: karmanyan'®. This is noted as: 


Al-^ — B (C) 


15 For a history of development of this model and critical review, see (Houben 1999 p. 41-46). 


16 Note the following remarks: "Since it reproduces standard speech, the A. [Astadhyayi] 
is a prescriptive grammar. It states the rules which must be applied, if the speaker wants 
to convey meaning in a grammatically correct form. It is also a generative grammar, in two 
senses. First in this (Chomskyan) sense that in the process of derivation the wordform is 
fully described. Secondly, in the sense that, with the help of a limited number of rules (about 
4000), and with the help of the dhatupatha and ganapatha, which provide the basic lexical 
elements, the A. is able to produce an infinite number of words, and thus, an infinite number 
of sentences." (Joshi and Roodbergen 1991 p. 15-16). Joshi uses the term generative grammar 
taken from the Chomskyan context, but at the same time he clarifies that it be used in a 
slightly different manner in Paninian system (Joshi 1968 p. ix.fn.22). See also (Joshi and 
Roodbergen 1980 p. vi-xv). 


17 Sal aR 6.2.99 > ik is replaced by yan if it is followed by ac. 


18 PHOT 13,3. ll > ifa pada functioning as karman occurs together with a dhatu then suffix 
a(n) is introduced after the dhatu. 
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The nature of this formulation by Bhate and Kak is close to the context- 
sensitive rules mentioned earlier. The main contention of this representation, 
however, is a general one. It says that in the grammatical process a given 
element A attains a particular identity or is transformed to some other form B. 
This happens when there is some suitable condition (C). It does not take into 
account the details of the derivational process nor does it provide a practical 
framework to apply the rules. Moreover, it does not account for instances 
where the derivational history or earlier stages provide the conditions for 
some operation. Nor does it specify the different kinds of operations that 
are needed for the process of synthesis. Apart from a few examples to show 
the formal nature of some of the rules of Astadhyayi, it fails to develop a 
workable model of the Paninian processes. 


In his article on the context-sensitive nature of Paninian rules Staal clearly 
notes that this is only the case with a limited number of rules. To quote him: 


In the following we shall be concerned with some rules of Sanskrit grammar as de- 
scribed by Panini, which are context-sensitive. It is neither suggested that such rules 
suffice for the description of Sanskrit grammar, nor that Panini thought so (Staal 1965 
p. 63-64). 


Despite the cautious note of Staal, Bhate and Kak suggest the proximity of 
computer programs and Paninian grammar, primarily on the basis of such 
rules. The following quotation by them is illustrative of this hypothesis: 


The rules [of Astadhyayi] are of different kinds: some are universal and context- 
sensitive transformations, others operate sequentially or recursively. Generally these 
rules are expressed in three groups: (i) rules of interpretation or meta-rules-safíjfi? and 
paribhasa rules, (ii) rules of affixation-rules prescribing affixes after two kinds of ba- 
sic dhatu and pratipadika roots, and (iii) rules of transformation for the stems and the 
suffixes-the morpho-phonemic rules. Note that a computer program has exactly the 
same general features of context-sensitive rules, recursion, and sequential rule ap- 
plication. It is not surprising, therefore, that these sitras have been compared to a 
computer program that generates Sanskrit sentences. Panini's grammar is algebraic 
where a finite set of rules generates an infinite number of words and sentences (Bhate 
and Kak 1993 p. 2). 


They do not show the recursive nature of Paninian rules, and the context- 
sensitive character, mentioned by them above, is not what Staal demonstrates 
for some phonemic substitutions only.'? 


Taking the above clue, Sridhar Subbanna and Shrinivasa Varakhedi, in 
their paper on the computational structure of the Astadhyayi, mention that 
“T]he structure [of Astadhyayi] consists of definitions, rules, and meta-rules 
that are context-sensitive and operate in sequence or recursively (Subbanna 


19 To substantiate their claims, they further point out the principles of numerical correspon- 
dence 1.3.10, the idea of ellipsis (anuvrtti), code-letters (anubandha) and the law of general 
and exceptional rules (utsarga and apavada). 


14 1 Introduction 


and Varakhedi 2009 p. 56)" ?? Following the same note, Pawan Goyal, Amba 
Kulkarni and Laxmidhar Behera posit the context-sensitive nature of vidhi 
rules. To quote them: "It has been already recognized that Panini expresses 
all such rules as context sensitive rules (Goyal, Kulkarni and Behera 2009 
p. 144,153)" 21 


The claims of Peter Scharf and Malcolm D. Hyman about the XML and 
Pearl scripts they wrote for sandhi, nominal and verbal inflections, are modest 
in comparison to the above examples (Scharf 2009 p. 117-125). The authors 
note that "[W]e look forward to utilizing the enriched framework in a 
revised, faithful model of Paninian declension. We are currently enriching 
the XML tagset further to allow derivation of participle stems and hope 
to go on to implement derivational morphology generally (Scharf 2009 
p. 125)" Hyman introduces an XML vocabulary for expressing Panini's 
sandhi rules (Hyman 2009 p. 253-265). XML, however, is again a framework to 
implement context-free grammars, which sometimes in their later variations, 
like XML-Schema, can be extended to represent context-sensitive rules 
(DeRose 1997 p. 105-106,139-142). The framework and the corresponding 
data structures, therefore, fall short of the potential to implement the rules 
where the conditions are formulated in a more complex manner than the 
immediate left or right contexts. 


To conclude, scholars like Staal took inspiration from the generative 
grammar approach of Chomsky and tried to show that some of the rules of 
the Astadhyayi correspond to the Chomsky hierarchy. The fact that gram- 
mars listed in the Chomsky hierarchy are suitable for computer languages, 
prompted some to hypothesize that the entire grammatical process can be 
written like a computer program. The above review shows that the recent 
attempts to computerize the Astadhyayi emphasized the context-sensitive 
nature of Paninian rules. There is, however, no study which establishes it to 
be sufficient for implementation of the whole of the Astadhyayi on computer. 
Pànini's work with a formal structure that "can be easily adapted so as to 
perform numerical processing" (Bhate and Kak 1993 p. 2) is still waiting for 
computer implementation. In this regard, the following remark of J. E. M. 
Houben made some years ago, is still pertinent: 
Since at least twenty years there have been ideas to develop "programs replicating 
Paninian prakriya" and programs that analyse "strings in terms of Paninian rules" 
(cp. Cardona 1999 : 272f). Inspite of several elaborate and sophisticated attempts in 
this direction, it seems we are still far from a comprehensive and convincing en- 
dresult. Why is it proving so difficult, for at least some twenty years, to computerize 


20 Kiparsky (2002) seems to be misquoted here by Subbanna and Varakhedi. The statement 
is from Bhate and Kak (1998 p. 2). 

21 By “all such rules" is meant here rules for “assigning a name, substitution, insertion, 
deletion". They refer to the previous study of Bhate and Kak (1993). 
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Panini’s grammar? Perhaps a major reason is that we are not clear on some crucial 
issues regarding Panini’s grammar (Houben 2009 p. 18). 


The above scepticism is shared by other experts of Paninian grammar. During 
his keynote address at the "Third International Symposium on Sanskrit Com- 
putational Linguistics" at Hyderabad in 2009, S. D. Joshi made the following 
remark: 


Reading statements about information coding in which Panini is hailed as an early 
language code information scientist, I am reminded of the situation in the early six- 
ties, after Chomsky had published his book on Syntactic Structures in 1957. Here 
Chomsky introduced a type of grammar called transformational generative gram- 
mar. It earned him a great of applause, globally, I may say. Then it dawned on lin- 
guists that Panini had also composed a generative grammar. So Panini was hailed as 
the fore-runner of generative grammar. That earned him a lot of interest among lin- 
guists. Many linguists, foreign as well as Indian, joined the bandwagon, and posed as 
experts in Paninian grammar on Chomskyan terms. Somewhat later, after Chomsky 
had drastically revised his ideas, and after the enthusiasm for Chomsky had sub- 
sided, it became clear that the idea of transformation is alien to Panini, and that the 
Astadhyayi is not a generative grammar in the Chomskyan sense. Now a new type of 
linguistics has come up, called Sanskrit Computational Linguistics with three capital 
letters. Although Chomsky is out, Panini is still there, ready to be acclaimed as the 
fore-runner of Sanskrit Computational Linguistics (Joshi 2009 p. 1). 


It should be noted here that some scholars have expressed their disagree- 
ment regarding the very possibility of computer automation of the Astad- 
hyayi. Thus Frits Staal conjenctured in the year 1966: “The third stage, that 
of automation, ...may not even be effectively realizable" (Staal 1966 p. 209). 
Hartmut Scharfe, notes recently some four decades after Staal: 

We have to reject, I believe, the idea that Panini's grammar is, as it were, a machine 


that produces correct Sanskrit words and sentences, if only we apply its rules in con- 
formity with established meta-rules of application (Scharfe 2009 p. 85). 


1.2 Nature and scope of the present work 


It is in the context of the above mentioned scepticism regarding the formal- 
ization and computerization of Astadhyayi that the present study assumes 
its relevance. It must be mentioned at the outset that this study does not 
intend to identify or establish the presence of features of modern linguistics 
and computational linguistics in the Paninian system of Sanskrit grammar. 
Neither does it seek to show that Panini anticipates several modern ap- 
proaches or that his Astadhyayi can be considered as the "first computer". 
The main aim of the present effort is to explore the possibilities of a formal 
representation of the content and processes of the Astadhyayi and to enquire 
into the potential of its computer implementation. 
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The first question that arises is whether the text corpus of Astadhyayt 
is formal enough to allow direct computer implementation? In other words, 
would it be possible to write a program whose input is the text corpus of the 
Astadhyayi, and whose output is, a representation, which a computer can 
interpret and apply. I put forward this question as the first hypothesis and 
call it the strong version of the formalization hypothesis. 


If the above hypothesis is true, then it would imply that what one needs 
to undertake is to follow the Astadhyayr in its text and spirit and devise 
programs to implement it in toto. And since the Astadhyayr is interpreted 
and applied with the help of other later texts—like Siddhantakaumudi 
for applicational considerations or Paribhasenduóekhara for meta-rules 
etc.—these can be taken into account for the purpose of computerization. In 
other words, the task would be to accurately simulate the traditional manner 
of grammatical representations and applicational procedures. Most of the 
ongoing projects on computerization of the Astadhyàyi work along these 
lines. In approaching the task of computerization in this manner, they attest 
to a tacit confidence in the feasibility of this hypothesis. 


Such an approach undertaken by several scholars is understandable. 
After all, there is a well established tradition of Panini and the Paniniyas 
spanning over two and a half millenia which is a glaring testimony to its 
efficacy. This further substantiates the opinion that there is absolutely no 
scope for any kind of tampering with the established way of reading and 
applying it. This is also in accordance with the principle set by Patafijali in 
his Mahabhasya where he cautions against any such attempt.”* 


After examining the grammatical corpus my conclusion is that it would 
not be possible to write a computer program that can directly process the 
present corpus of the Astadhyayi. This, however, does not imply that the 
Astadhyayi completely lacks formal components. Pànini's work is an exem- 
plary attempt to formulate the grammar in a formal manner. Research in the 
last few decades has adequately established this fact.? The point here is not 
whether or not the Astadhyayi has formal components, but whether these 
are sufficient for a direct implementation on computer. 


There are several challenges which I discuss in section 3.2. The main ar- 
gument against such an approach is that it would entail considering the 
Astadhyayi as a closed, complete and perfect device. This, however, is not 
the case. For example, we do not have any precise information regarding 
the rule boundaries in several cases. Moreover, there are later additions and 


?? See e.g. (PB. 158). 


233 See here specially the following publications: (Staal 1965, 1965a, 1975), (Petersen 2008) 
and (Kiparsky 2009). 
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emendations suggested e.g. by Katyayana. A closer look at the grammatical 
corpus suggests that it is judicious to consider it as an open, flexible and 
growing network of grammatical content and processes, based on some 
fundamental systematic methodology, which can accommodate additional 
information if it is needed for precise specification. 


Another aspect that argues against the above hypothesis, is that the corpus of 
Astadhyayi is composed with organizational optimality (brevity or laghava) 
as its main goal. The application of grammar for a particular process is not 
explicitly mentioned and is left largely to the person using it. This, however, 
needs to be specified in an explicit manner in case computer implementation 
of the derivational process is desired. 


At this point it becomes important to clarify the nature of the main task 
of this research. Formalization and computerization involve representing the 
content and processes of the Astadhyayi in a new medium. The formula- 
tion of the corpus of Astadhyayi is in Sanskrit with special meta-linguistic 
conventions. Moreover, it is meant for application by individuals who, after 
understanding and remembering its techniques, apply it for derivational 
procedures. Both these aspects—i.e. the oral framework and application by 
human individuals—change in case of a formal representation and computer 
implementation. Thus, while the content and processes remain the same, the 
manner and the medium in which these are comprehended or represented is 
different and the applicational agency is now not the learned human students 
of grammar, but computers or logical systems. 


Acknowledging that the strong version of formalization hypothesis is 
not feasible and the current formulation of the Astadhyayi was meant for oral 
transmission and application by human scholars, the next task is to explore 
the other options. One of the main aims of this study is to enquire into the 
questions: Does Astadhyayi function in an algorithmic manner? If yes, then 
what is the nature of this algorithm? What approach may one take to make 
the algorithmic character of the Astadhyayt explicit? 


The first reaction— prevalent among several modern scholars of Astad- 
hyayi as well—is that it is an example par excellence of a perfect algorithm 
for generating standard Sanskrit expressions. The reason provided by them 
is that Astadhyàyt: employs a highly developed meta-language that clearly 
specifies the rules of grammar and that linguistic expressions can be gener- 
ated by applying these rules. Moreover, there is a well established tradition 
of grammarians—the Paniniyas—contributing to the understanding of the 
Astadhyayi. 


Following this viewpoint, one may assume that Astadhyàyi consists of 
fixed structures that are represented in a consistent manner. These structures 
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comprehend and explain the Sanskrit language. Further, the nature of these 
structures is algorithmic and is guided by the rules and meta-rules of Astad- 
hyayi. Any Sanskrit expression can be derived by following an algorithmic 
procedure. It would involve applying the relevant rules one after another. 
One may implement this task through a computer program which would 
involve telling the computer what to do next. The program would judge 
whether a particular rule is applicable or not and execute it accordingly. The 
task here is primarily of a technical nature. 


The second stage is that such a program could—to a large extent—interpret 
and decide, what is to be done next. This is a more demanding task. The 
underlying assumption is that there is some principled system that guides 
the dynamics of the derivational process, some inherent order based upon 
which derivational stages can be interpreted and right decisions can be made. 
At this point it becomes important to ask about the manner in which the 
grammatical corpus is articulated and the way in which the derivational 
process is executed. How far the tasks which are formulated in a special- 
ized Sanskrit and are designed and meant for human application can be 
transferred to the computers? In which manner? And what is the way to do it? 


Several issues are involved here. Whether the manner in which the grammat- 
ical system is articulated in Astadhyayr is feasible and suitable for computers 
or not. Whether the manual application of a grammatical system for the 
purpose of deriving linguistic expressions can be simulated or not. How 
much and in which manner the task of decision-making can be invested 
in a non-human logical apparatus? In other words, what kind of model 
for the Paninian system is most suitable for the purpose of its computer 
implementation? 


Given the opposing views between some experts of the Paninian system and 
the researchers attempting to automate the Astadhyayi (see section 1.1), I 
intend to approach the task of formalization and computer implementation 
differently. Instead of attempting to automate the Astadhyayi directly, I 
suggest first looking into the underlying systematic approach on the basis 
of which grammar is constructed. The systematic approach is to be gleaned 
by examining the descriptive methodology of ancillary disciplines.” Apart 
from the fact that they are associated with the Vedas, there is an evident 
commonality of approach with respect to their goals and methods. The prime 
effort of the ancillary disciplines is retention of a given phenomenon. For the 
Srautasütras this phenomenon may be a given sequence of ritual actions. The 
Pratisakhyas, as well as Siksa and Chandas texts, aim towards retention of a 


24 The ancillary disciplines or the Vedangas (lit. limbs of the Vedas) are: Siksa (phonetics), 
Chandas (prosody), Vyakarana (grammar), Nirukta (etymology), Kalpa (instructions on ritual 
practice) and Jyotisa (astronomy). See: (Gonda 1975 p. 34). 
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specific collection of Vedic recitations. The Sulbasiitras provide retention of 
the plans and designs of the ritual arena etc. Similarly, the Astadhyayi seeks 
to retain the standard usage of Sanskrit expressions. 


For the purpose of retention of some given phenomenon, the ancillary 
disciplines follow a systematic method, which again is common to all of 
them. In order to substantiate the proposition that there does exist such an 
underlying general system of description of a given phenomenon, which 
permeates across the ancillary disciplines, I have worked-out a few detailed 
examples in appendix A.1. 


Seen from the systematic point of view, grammar also follows the same 
goals and methods. There is no fundamental difference between Panini's 
system and the systems developed in other ancillary disciplines. This also 
explains why technical terms from other disciplines could be easily borrowed 
and utilized in the grammar. Moreover, amendments and extensions in the 
grammatical corpus indicate the presence of an underlying system which 
facilitates flexibility and portability. 


Although apparently not so spectacular the proposition that there is a 
common underlying system across the ancillary disciplines has significant 
consequences for formalization of the Astadhyayi. The grammatical corpus 
can now be considered as a presentation of this general system in a par- 
ticular framework. It is the framework which specifies how components 
are enunciated in the corpus, how are they characterized and combined. 
Further, how the entire mechanism is organized and applied, as well as how 
it is communicated is also dependent on the framework. Thus, I propose to 
distinguish between the general system of grammar and the framework in 
which it is presented. The corpus of Astadhyayr, one can now assert, is the 
general system presented in a special framework. One may call this special 
framework the Paninian framework. 


The strong version of formalization hypothesis can now be reformu- 
lated as follows: the Paninian framework is not sufficiently adequate for 
formalization. It does not negate or ignore the fact that the Paninian frame- 
work is a wonderful example of a major effort to present the general system 
in a formal manner. Yet it is not adequate for a computer implementation 


235 Examples include anusvara, sasthi, saptami etc. There are some 50 terms which Panini uses 
without defining them. For a list, see (Subrahmanyam 1999 p. 109-163). 


26 One may select a consistent and adequate core that can be formalized, akin to S. D. Joshi's 
proposal to consider the systematic consistent portions to be the original core and the in- 
compatible parts as later additions. See: (Joshi and Bhate 1984 p. 252-253). If one is adamant 
to sift out parts of Astadhyayi that would conform to a formal representation, then it is an- 
other matter. In that case, however, to identify these portions, one would eventually require 
such a formal framework! 
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The task which lies ahead is to evolve a new formal framework in which the 
contents and processes of the Astadhyayi can be represented. 


Here, I put forward and intend to establish, another weak version of the 
formalization hypothesis. Its main propositions can be stated as follows: the 
grammatical system followed by the Astadhyayi is an outcome of a com- 
mon systematic approach followed by the ancillary disciplines (Vedanga) 
associated with the Vedic corpus. The functioning of the general grammat- 
ical system can be represented in a formal manner. For this a new formal 
framework would be required. Paninian content and processes can then be 
re-presented in this framework. The new framework, being formal in nature, 
can also be implemented on computer. 


The tasks mentioned above are organized in the following manner. 


System 
Paninian framework Formal framework 
Astadhyayi Formal representation 
? 6” 
Ld 
Ld 
Ld 
Ld 
k 
Computer implementation Computer implementation 


At the top of the diagram is the general system which I outline in chapter 2. 
This, I propose, constitutes the core methodology of the grammatical pro- 
cesses. It is not grammar but the basic methodological system on which the 
grammar is specified. 


The Astadhyàyi of Panini can now be seen as a formulation of this sys- 
tem in a particular framework, which I call the Paninian framework. The 
nature and characteristics of this framework are known to us from extensive 
scholarly research on the Astadhyayi and is not the main focus of the present 
study. In sec. 3.2 some of the problems are mentioned which one would face 
if one were to attempt a direct implementation of the corpus of Astadhyayi. 
In the above diagram it is noted by a dashed arrow with a question mark. 


L however propose to undertake a re-presentation of the grammatical 
system in a different framework, which I call a formal framework. This 
new framework is introduced in chapter 3. Chapter 4 provides a formal 
representation of Astadhyayi in terms of statements that are formulated 
in the new framework. Finally, chapter 5 provides the algorithms for a 
possible computer implementation of the statements of the Paninian system 
of Sanskrit grammar. 


Chapter 2 
System 


One of the prime goals pursued by the ancillary disciplines associated with 
the Vedas is retention of a given phenomenon.! By a given phenomenon, I 
mean any existing linguistic or cultural practice established over a number 
of generations. It is something which one has received as the standard and 
would like to protect it and pass it on to the next generation—for example, 
the recitation of the Vedic mantras, performance of rituals or linguistic usage. 


In order to achieve the goal of retention, these disciplines also follow a 
common systematic approach which consists of two interdependent and 
complementary processes. The first one is analysis of a given whole into 
constituent components. The other one is synthesis through (rule based) 
combination of components to regain the given whole. 


analysis 
pasar om 
d TEE 
components whole 
synthesis 


For example, the continuous recitation (samhita-paátha) is analyzed into a 
word-for-word recitation (pada-patha) and the Pratisakhya texts provide 
a rule based synthesis from word-for-word recitation to the continuous 
recitation. Similarly, the Sulbasütras provide the rules for preparing the ritual 


1 The six ancillary disciplines associated with the Vedas are said to be phonetics (Siksa), 
prosody (Chandas), etymology (Nirukta), grammar (Vyakarana), instructions on ritual prac- 
tices (Kalpa) and astronomy (Jyotisa) (Gonda 1975 p. 34). For a summary of the literature on 
phonetics and grammar, see (Scharfe 1977), on ritual practices (Gonda 1977) and for astron- 
omy (Pingree 1981). 
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altar. The building blocks of the geometrical figures (squares, rectangles, 
triangles etc.) are gained through the process of analysis. Similarly, once the 
individual steps of a particular ritual are identified, these are arranged in 
the $rautasütras and one can re-constitute the ritual by following the rules 
mentioned there. 


This apparently cyclical exercise comprehends a given phenomenon in 
a systematic manner and gives rise to an interconnected structure of compo- 
nents. Such structures have the tendency to last longer and are explained on 
the basis of the underlying system. Moreover, structures facilitate variations 
and change. 


In section A.1 of the appendix, I have worked out a few examples to 
show the details of these processes of analysis and synthesis. In this chapter, 
we will focus on the special case of linguistic expressions and the grammatical 
cycle. 


2.1 The grammatical cycle: analysis and synthesis 


There is a consensus among Sanskrit grammarians, both ancient and modern, 
that grammar involves primarily an analysis of a given linguistic expression 
into its constituent components. Analysis follows the process of concurrent 
presence (anvaya) and concurrent absence (vyatireka).” The process of analysis 
is called anvakhyana and it has two stages: 


1. Vakyavibhajyanvakhyana is analysis of sentences (vakya) and identifying its 
component words (padas). For example, analysis of the continuous recita- 
tion of a Vedic mantra or a Sanskrit sentence into constituent words. 


2. Padavibhajyanvakhyana is further analysis of individual padas in its con- 
stituents. This stage of analysis yields more granular components. 


For example, the following sentence having two padas : 


2 The term vyakarana for grammar, whose etymological meaning is analysis, reflects these 
processes. It is used in the sense of separation of things in (SB. 1.7.1.4) and (KSS. 4.2.7-9), in 
the sense of discrimination between something, for example, between satya and anrta as in 
(VS. 19.77), as something which was previously one single entity, that was not previously 
differentiated, becoming or being made differentiated in (BU. 1.4.7), or it can also signify 
the act of dividing, e.g. in (SB. 3.3.1.13) and finally in the sense of making something clear 
in (YSB. 3.17) or Vyasabhasya on (YS. 3.17). George Cardona (1999 p. 564-576) provides a 
detailed exposition on the etymology and uses of vyakarana as well as other expressions for 
grammar. 

3 R. S. Bhattacarya (1966 p. 212-237) discusses these processes in detail. See also (S. D. Joshi 
1968 p. xi). 
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balakah pathati (a boy recites) 
is analyzed into constituent components such as: 
balaka, s(u), path(a), (&)a(p), ti(p). 


The rules of grammar, however, establish a correlation between the analyzed 
components with the given linguistic expression. This is achieved by com- 
bining the components in a controlled manner to obtain the linguistic expres- 
sions. The grammatical system therefore not only undertakes an analysis of 
a given whole into components, but also consists of the complementary pro- 
cess of synthesis of the analyzed components to regain the original expression. 
This, in a way, is a cyclical exercise. 


analysis 
balaka, s(u), path(a), (&)a(p), ti(p) balakah pathati 
synthesis 


A given linguistic expression is first analyzed into constituent components 
and then re-gained by combining the components. 


The process of synthesis is rule based. Rules lay down the constraints 
regarding the choices and manner in which the components should be 
combined. They prevent haphazard and unrestrained combinations and 
endorse the claim of the grammarians that the components are constituents of 
the original expression. The choice of components is vindicated by successful 
connection between them and the original expression. 


In contrast to the rule-based nature of synthesis, there are no rules that 
bring about analysis of a given expression into constituent components.* 
The components, however, are chosen in a manner so that they lead to the 
original expression once combined according to the rules of synthesis. The 
constituent components are stipulated by the grammarians for the purpose 
of grammar and exist within the grammatical system. They are not found in 
common usage and are, so to say, imaginary (kalpanika) i.e. improvised by 
the grammarians for their employment within the grammatical system.? 


^ There are a few exceptions in the Pratisakhya texts where such an attempt is made. See 
section A.1.3. 

5 The grammatical components exist in the grammatical system (Sastra) only and not in the 
common usage (Joka). Further, they are not established on the basis of usage (lokatah siddha) 
(R. S. Bhattacarya 1966 p. 213-214). Nage$a Bhatta in his Vaiyakarana-siddhanta-parama- 
laghu-mafijüsa remarks the imaginary nature of the components that are improvised by 
the teachers and employed only within the grammatical system (K.D. Shastri 1975 p. 7). 
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Moreover, the grammatical components are dependent on their parent 
expressions. This would be evident through an example Vyasa provides in 
his commentary on Yogasütra (3.17). The point he is noting there is that it 
is important to analyze the individual words further in order to determine, 
whether a word like bhavati, asvah or ajapayah denotes an act or a karaka.° Here, 
bhavati can be either vocative singular feminine of the respectful pronoun 
bhavatu, and in this case consisting of the components bhavati-s(u),’” or 
third person singular of the present form with components bhü-($)a(p)-ti(p). 
Similarly, a$val can have a$va-s(u) meaning “a horse", or second person 
singular aorist form with Svi-a(n)-si(p) meaning “reached” and ajapayah 
referring to an object (goat's milk) has ajapayas-am or the second person 
singular imperfect causative verb form of ji having ajap-i-a-s. These examples 
show that the components are dependent on the parent expression or the 
given whole. If the given whole signifies an act, then the components are 
different from when it denotes a karaka. 


Not only are these components imaginary and dependent on their par- 
ent whole, they are also not unique. In other words, there may be more 
than one analysis of the same original expression depending upon the 
grammatical tradition. As an example, consider the expression avati (leads, 
brings to) which is derived using the root av according to the grammar of 
Whitney (1885 p. 4), but the root u(n) according to the Paninian system. This 
implies that there can be more than one grammar of the same language. 


The process of synthesis is provided through the rules of the Astadhyayr. 
Without entering into the details of derivations, it suffices here to mention 
that the components are combined to create the given whole? Apart from 
that, the rules of grammar may also change depending upon the standard 
usage which they are supposed to account for. 


Although the stipulated components are non-real—in that they come into 
existence through the grammarians and exist within the realm of grammar— 
this is not the case with the given linguistic expressions themselves. They are 
established (siddha) on the basis of their abiding usage among people.? 


€ The word karaka literally means “doer of an action" and in grammar used in a technical 
sense as "instrument of action" (Abhyankar 1986 p. 118). There are six karakas. For details 
see (Cardona 1974 p. 231-306). 

7 An example Ha NTR | comes in the commentaries to Paraskaragrhyasitra (2.5.2-4) and 
Manusmrti (2.49). 

8 S. D. Joshi (1968 p. ix-xi) notes this complementary character of the analysis (vibhajya an- 
vükhyana) and synthesis, combination or integration (vrtti). 

? R. S. Bhattacarya (1966 p. 228-237) summarizes by noting that the constituent compo- 
nents of a Sanskrit expression are non-real (kalpanika), the rules of grammar (upaya) are not 
uniquely fixed (aniyata), but the linguistic expressions (upeya) are established (siddha). 
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Thus, the grammatical system as a whole consists of the complementary 
processes of analysis and synthesis. Any given standard linguistic expres- 
sion (henceforth represented by the symbol x) is analyzed in constituent 
components (henceforth p;). Astadhyàyi is a collection of the constituent 
components p; together with the rules of synthesis. One can represent this 
cyclical process schematically as follows: 


analysis 
p Sra 
set” r7 
Led ~ 
k ~ 
° E CN 
synthesis 


2.1.1 Why grammar? 


A pertinent questions arises here: Why should this cyclical procedure be 
undertaken at all? If grammatical processes involve regaining the original 
expression after one has analyzed it into components, then what is the use of 
such an exercise? In other words, what purposes are served by developing 
such a system? 


The passages in the Paspasahnika—both the Varttika of Katyayana and 
the Bhasya by Patafijali—are explicit as to the purposes of grammar. 
Katyayana enumerates five: 
(i) raksa@ or safeguarding the Vedic texts as well as standard usage of the language (ii) 
üha or suitable adaptation and modification of a mantra according to the requirements 
of a particular ritual (iii) gama or complying with Vedic injunctions (iv) laghu or econ- 
omy of effort and (v) asamdeha or absence of doubt as to the standard expressions are 
the purpose.’ 


The above enumeration of Katyayana and further explanations of Patafijali 
point out that the main purpose of developing the grammatical system is to 
safeguard the standard usage and effectuate acceptable modifications. 


Both the non-Vedic common usage (laukika), as well as Vedic corpus, 
are sought to be preserved by the grammar. The Vedic corpus is a clearly 
delineated collection, and as to common usage, Paniniyas recognize certain 
model speakers (the sistas) who set the standards. Patafijali in his Mahab- 
hasya on 6.3.109 provides several characteristics of the model speakers on 


10 THRACE: Heller! (PV. 2). 
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the basis of their knowledge of the grammar as well as their dwelling place 
and behavior. 


These model speakers are brahmanas who dwell in the country of the aryas where 
alone their exemplary behavior is found. The brahmanas who dwell in this abode 
of the dryas and have only as much grain as they can carry in a small pot, are not 
greedy for honor, following established rules of correct behavior without having to 
be given any immediate cause for this, and who have attained total expertise in some 
traditional area of learning without explicit instruction, these honorable ones are the 
Sistas." 


What is meant by correct speech or standard linguistic expressions, is the 
way these model speakers would speak. 


The next question is how to safeguard standard usage. Patafijali's search 
for an answer to this question sheds important light on the nature of the 
grammatical method. The discussion on this point is in (PB. 46-55) and can 
be summarized as follows. 


One obvious way to retain the standard speech forms would be either 
to exhaustively specify them by uttering them one after another, or to stip- 
ulate the complementary set of non-standard forms. Both these options are 
to be discarded. It may be possible to retain a limited collection of utterances 
(as in case of the Vedic Samhitas) but impossible for the set of expressions 
of common speech as it is too large.? Patafijali finally suggests providing 
a description of this big collection of standard expressions by pointing out 
their general and special characteristics and recording them in a systematic 
manner. His advice for attaining this is by putting constraints (niyama) that 
can distinguish between standard and non-standard speech. 


In order to understand the concept of constraint or niyama, it is neces- 
sary to look into an important aspect regarding the nature of a linguistic 
expression, namely the distinction between its form and content. In Pas- 
pasahnika (PB. 4-12) Patafijali looks into several suggestions regarding the 
exact nature of an expression like gauh (a cow). He mentions and rejects gauh 
to be substance (dravya), action (kriya), quality (guna) and generic property 
(akrti).^ Finally, he characterizes expressions like gauh to be “that, which 
when uttered brings forth an understanding of an object with dewlap, tail, 


1 qd qfé ated SING | a TIR aaa aR TREE À aren: SRT SYST MTA- 
HUER: HAG BAMA: MORANA: RET: | (Transl. Cardona 1997 p. 552). For a 
detailed discussion about the Sistas, see also (Aklujkar 2004 p. 713-717). 

? For details see (Joshi and Roodbergen 1986 p. 70-78). 

3 Patafijali tells here the story of Brhaspati and Indra who started in this manner and gave 
up owing to the large numbers. If it was impossible for the divine beings, then even more 
so for the mortal human beings, see (P. 50-51) (Joshi and Roodbergen 1986 p. 74-75). 

4 See (Joshi and Roodbergen 1986 p. 12-17) and (Ganeri 1995 p. 4-10). 
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hump, hoofs and horns".? This definition by Patafijali points towards an im- 
portant distinction between the form and content of any linguistic expression. 


The form of an expression is its phonetic part, which is uttered and which, 
when heard, produces an understanding of its content or meaning. 


Panini and the Paniniyas distinguish between the form of an expression 
and its content. The term commonly used for the form is rapa and for 
the content or meaning, artha is employed.!é The distinction between the 
phonetic form of a grammatical component and its content is explicitly stated 
in the rule: soam rüpam sabdasyasabdasamjna!’. Here, Panini specifies that 
"[A] word (in a grammatical rule) which is not a technical term denotes 
its own form".'? For example, if a component like agni is mentioned in the 
grammar, the grammatical operations are applied only to the own form (svam 
rüpam) of agni, namely the phoneme strings / agni / and neither to the actual 
thing meant by it, nor to any other linguistic form, like pavaka, that can be 
used for the thing meant. This is not the case for technical terms occuring 
in grammar. The operations are applied not on the phonetic form of the 
terms themselves, but on the actual constituent components for which they 
stand. For example, a mention of nistha implies the components (k)ta and 
(K)tavat(u).? In a rule like sphayah sphi nisthayam?? where the substitution 
is to be applied on the condition that nistha follows, what is meant is that 
if the suffixes (k)ta or (k)tavat(u) follow and not the phoneme string / nistha /. 


Similarly, in the rule: sarüpanamekasesa ekavibhaktau?! Panini provides for 
the remainder of only one among those which have the same form. As 
explained in Kasikavrtti, it is important here to differentiate between the 
form and its content. The rule applies only when the form is identical, and 
not when the content is same and form may be different.” 


5 eee seq: | Tel SARA SATS HAGA eret AA E are: | (P. 13-14). 

$ The expression rapa in the sense of form is used more than once in Astadhyayi including 
1.1.68, 1.2.64, 2.2.27, 3.1.94, 6.1.94. The expression artha for meaning or content is employed 
more frequently (around 80 times). Some examples are: 2.3.46, 1.1.19, 1.4.19, 1.4.85 etc. 

7 ef wd seges !12.?.&€1l » an expression which is not a technical term denotes its 
own form. 

8 Transl. John Brough (1951) reprinted in (Staal 1972 p. 403). For a discussion on this rule 
see (Brough 1951) reprinted in (Staal 1972 p. 402-414), (Scharfe 1971 p. 40) and (Staal 1975 
p. 331 ff.) and more recently (Scharfe 2009 p. 182-196). 

9 vibe FT 2.2.28 Il > (k)ta and (k)tavat(u) are nistha.. 

20 rea: EA ABATA, 1 8.2.22 ll > sphay(1) is replaced by sphi before nistha. 

21 &RENTUEEGSIS. Waa 2.2.88 ll > of those with same form, only one remains, in case of 
single vibhakti. 

22 TETAN aH? J-A: | KV on 1.2.64. Why it is stated that "having the same form"? 
To exclude cases like Plaksa and Nyagrodha, which have the same meaning but different form. 
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In other words, the form of a technical term does not contribute to the 
formation of intended expression, but only its content, namely the compo- 
nents to which it refers. In the case of a grammatical component, however, 
both—its form as well as content—become important and any mention of 
it in the grammatical corpus refers to both. When, however, only content 
or meaning needs to be stated, then the expression iti is used after it to 
indicate that in this case, not the form but only its content is to be taken 
into consideration.” For example in the rule: tasminnitinirdiste pürvasya^^ the 
expression iti after tasmin indicates the locative case meant by it and not the 
phonetic form / tasmin / . 


The expression artha is used frequently by Panini to denote content or 
meaning of grammatical elements. One of the conditions for a component 
to be pratipadika or nominal stem is that it should be arthavat or “having 
some meaning" 2? He also employs the expression artha when the content of 
some component is important for grammatical operations. As an example: 
matvarthe?? or trtyarthe?" when one wants to express the meaning conveyed 
by the suffix mat(u)(p) or the content of trtiya. 


How is the form of any expression related to its content? According to 
Katyayana : "there is an established (siddha) relation between a linguistic 
expression (Sabda) and its meaning (artha)" ?9 From this, it follows that the 
relation between the form of any linguistic expression and its content is 
"established". Further, the fixed nature of this relation is “on the basis of 


Incidently, the two trees mentioned here Plaksa (Ficus infectoria) and Nyagrodha (banyan, 
Ficus benghalensis L.) belong to the same family of fig trees, but are not same. 

23 Kaéikavrtti on 4 8f FRU Ul 2.2.22 Il: gRs Aam: | The employment of iti is to denote 
the meaning. 


[aS a a aS a. in’ c 


?4 GAT Ace FAC il 2.2.88 ll > locative case indicates that the grammatical operation is to 
be applied to the immediately preceding component. 

25 HIATT GAA: TATE NK > a meaningful component which is not a dhatu and 
not a pratyaya is pratipadika. 

26 aT ATA 2.8.23 > components ending in t or s before a pratyaya and having the mean- 
ing of mat(u)(p) are called bha. 

27 FATA 12.4.¢4 ll m anu is assigned karmapravacaniya when it denotes the meaning con- 
veyed by trtiya. 

28 firi agia | (PV. 3). My translation of siddha as ‘established’ is to point to the estab- 
lished nature of usage of words in connection with specific meanings. Patafijali discusses 
the use of the expression siddha and suggests that what is meant is nitya (permanent or 
eternal) and clarifies that nitya here is not necessarily in the sense of something which is 
unchangeable (kitastha) or immovable (avicalin), but rather something that becomes estab- 
lished through continuous repetition (abhiksnya) (P. 63-71). He further tries to justify the nitya 
(permanent) character of this relation and sums up by pointing towards the irrelevance of 
this discussion in the present context of mentioning the purpose and need of grammati- 
cal instructions (P. 72-79). For Grammarians’ discussion with the Vaisesikas on the relation 
between sabda and artha, see (Houben 1992). 
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its usage among the people". The point is explained by Patafijali in the 
following words: 
People, in their common usage, after having brought to mind respective meanings, 
accordingly use linguistic expressions. They do not make any extra effort to make or 
generate these expressions (from their meanings). On the other hand, they do put in 
extra effort to make things which are to be produced. For instance, one who needs a 
pot for some purpose, goes to the house of a potter and says: "You make a pot. I need 
a pot for some purpose". One who wants to use linguistic expressions does not go to 
the house of a grammarian and says: "You make linguistic expressions. I want to use 
them". He uses it according to the meaning he wants to express.?? 


Acknowledging the distinction between the form and content of any expres- 
sion, I propose it as the basis for differentiating two levels in the grammatical 
system: the form-level and the content-level. Any linguistic expression like 
gauh has some phonetic form / g au h / and content gauh or a cow. This can 
be depicted as in the following figure. 


gau h form-level 


acow  content-level 


Here the form-level is depicted above the content-level. The connection 
between the phonetic form / g au h / and its content a cow is established 
through usage and is represented by the connecting line. 


The above observations can be formulated in general terms as follows. 
Let x be any given linguistic expression. If zf denotes its form, z"' its content 
and the continuous straight line the fact that their connection is established 
through usage, then it can be pictorially represented as in the following 
figure. 


af 


am 


If the relation between a linguistic expression and its meaning is established 
through usage, i.e. if people are the authority with regard to these, then the 


29 dd: | (PV. 3) (Joshi and Roodbergen 1986 p. 115-116). 

30 qk A eA a ust eat aa Hale AGA: Bar: Alar: feed Tas uer feudi 
TAM Fed BH RRAGA Nel AE HS TA! HAA BRIA | A Tae TATA: d- 
FROGS Tea SE FE META! TH sa Aaa Welz | (P. 81). Translation (with 
minor adaptations) from (Joshi and Roodbergen 1986 p. 115-116) which includes a detailed 
discussion. See also (Scharf 1995 p. 74) for a slightly variant interpretation. 
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obvious question arises as to what then is the function of grammar. This is 
exactly the next question raised by Patafijali : 


If the people are the authority with regard to these (i.e. the relation between meaning 
and the corresponding linguistic expressions), then what is the use of grammar??! 


Katyayana provides an answer by introducing the concept of constraint or 
niyama. 


Given that linguistic expressions are employed in accordance with their meaning 
fixed on the basis of its usage among the people, constraint (niyama) is specified 
through grammatical instructions for the sake of dharma. And this is similar to the 
specifications of the constraints in case of instructions about non-ritual as well as 
Vedic ritual actions.? 


The general idea can be stated as follows: given a number of possible options, 
niyama is constraint or restriction which can be applied to distinguish and 
select certain specific options out of the various possibilities.?? In this context 
the following examples by Patafijali from both the Vedic as well as non-Vedic 
realm are illustrative and worth reproducing in full detail. 


With regard to common matters (loka) it is said that one should not eat tame cock or 
tame pig. But what is food (bhaksya) is taken to satisfy hunger and following this it 
could also be possible to satisfy hunger by eating the meat of dog etc. With regard 
to this, a restriction (niyama) is made, namely, this is eatable and this is not eatable.** 


Similarly, desire for a woman is because of sexual urge. Satisfaction of sexual 
urge is possible equally with a woman suitable for intercourse or not. With regard 
to this, a restriction (niyama) is made, namely, this is suitable and this is not suitable.? 


In Vedic instructions as well, it is said that a brahmin observes the vow (vrata) 
of living on milk, a ksatriya on gruel and a vaisya on indian cottage cheese (amiksa). 
A vow however is for the sake of taking food. It is possible to observe the vow by 
using rice and meat as well. With regard to this, a restriction (niyama) is made.?ó 


Similarly, it is said that the post for tying the sacrificial animals should be ei- 
ther of the bilva wood or of the khadira wood. Such a post is for the sake of tying the 
sacrificial animals. It is possible to tie the animal with any wooden post, standing or 
lying. With regard to this, a restriction (niyama) is made.” 


31 afe af Gren: ag sii f amavi RT | (P. 82). See also (Joshi and Roodbergen 1986 p. 117). 

32 cimdissirqu RSA SST TATA: | SEIT AIEE EIEEEI] (PV. 4-5). For a detailed discussion 
on this topic, see (Aklujkar 2004 p. 687-732). 

33 Paul Thieme (1931 p. 23-32) discusses this point on the basis of the beginning sütras of 
Vajasaneyi-pratióakhya. 

24 gi Aad Shea: MAGEE: STR: STRA: Ef Gud | ed UTR AAA sada | aed 
eres aaah: erf aperereequi | aa feret: fos | gd wea game aki (P. 84). 

55 qur GTA eil R: ate | TATA: SETH: TATA A STATA SII a PA: foe | gd TET SET 
af (P. 84). 

36 qd vq; aÀ Tala: AT: TARA: Tata: MARAT: Se: ef qug | ad Va STGTTERTSP TaN | 
are aris areata ema satu aa fran: fad | (P. 85). 

97 aa ea: a: AT FT: cand ll gud! qucd Wm WaT grid | ae sra fda «Iud. 
BGA AAPA TW WY: AGAL qa ferm: [nud (P. 85). 
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Similarly, when the potsherds have been put near the fire, the Vedic mantra : 
bhrgünam angirasam gharmasya tapasa tapyadhvamiti [be you heated with the heat of 
the sweat of the Bhrgu's and the Angirasas] is recited. Even without the formula, 
the fire, whose nature is to burn, heats the potsherds. And with regard to that, a 
restriction (niyama) is made, namely, when it is being done in this way, it leads to 
bliss in the form of heaven (abhyudaya).38 


In the same way here also, when meaning can be understood equally from 
the standard expressions (Sabda) and non standard expressions (apasabda), a re- 
striction (niyama) is made for dharma, namely, that meaning is to be conveyed by 
standard expressions only and not by non-standard expressions, as usage in this 
manner leads to bliss in the form of heaven (abhyudaya).?? 


The above quote validates the assertion that the main aim of the ancillary dis- 
ciplines, including grammar, is to preserve the given standard phenomena. 
The given phenomena may be the admissible objects for eating, or relation- 
ships with others, the way some ritual action is performed, or some linguistic 
expression is uttered. The systematic approach of these disciplines is not 
aimed towards generating rule based constructs, but to place constraints in 
order to select some preferable possibilities, from among the several existing 
options. 


For example, in order to express a cow several expressions like gauh, 
gavi, gorii, gota, gopotalika etc. are possible. From among them, only gauh is 
according to the usage of model speakers, although as Kaiyata mentions that 
“in common usage, non-standard expressions (apasabda) are also used and 


convey the same meaning as the standard expressions (sadhu-sabda)" .*° 


To summarize: The main task of a grammatical system is to ensure the 
retention of the standard usage of language. In other words, retention of the 
collection of the standard linguistic expressions. it is attempted to ensure this 
by placing constraints so that the inadmissible expressions are excluded and 
only the admissible ones are included. 


38 qq arit eae Tha asa | POT aaa unica quer quon afe | Ste eif rem arat: 
TAE HIST erede | Ta ferat: faac ed fac nR wafer fett (P 85). 

99 eee SETTAB ARTE t efferre: frd | rede ua arl: fpa: a aA en] d 
frr npa vafa aft (P. 86). 

^0 sl Ren seres aS: | (PB. 49). Saat È SPE psa, MENI- 
«meist! (MBP. 3.1.8). 
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2.2 Grammatical synthesis 


It is time now to focus on the rule-based process of synthesis which we de- 
picted by the following diagram. 


synthesis 
Pi =——— T 


Here p; represent the components gained by analyzing the linguistic expres- 
sion z. Examples of the components include: 


balaka, s(u), path(a), (&)a(p), ti(p). 


The sub-script i in p; is the general variable which stands for natural numbers. 
Thus, one can refer to the above five components as: 


pı : balaka, p» : s(u) etc. 


The task of grammar would be first to stipulate them and then to provide 
for their appropriate combinations. The combination must result in the 
given expression. A simple process of combination is the conjoining of two 
components one after another. More complex processes involve replacement, 
elision, augmentation and reduplication. 


If Xp; represents the combination of the constituent components, then 
the process of synthesis can be depicted as follows: 


Mp; => T 


The above figure says that, apart from the constituents p; of any expression z, 
the grammar also provides rules of their combinations i.e. X which when 
applied to the components, leads to the standard expression x. 


Thus, the task of grammar is two-fold: firstly to enunciate the compo- 
nents p; and secondly to provide for their combinations X so that the 
combined components result in the standard expression x. 


2.2.1 Form and content of a component 


Like linguistic expressions, the constituent components p; also have a form 
and some content. The form of any component consists of a sequence 
of one or more sounds. The next requirement is the specification of their 
content. This, I propose, can be comprehended through three basic categories: 
(i) the lexical meaning of a component (ii) the meaning-expressions that 
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are associated with a component in the grammatical corpus, and (iii) the 
characterizing attributes attached to the components. 


The lexical meaning is the semantic content inherently associated with 
a component. By inherent association, I mean that it is not explicitly men- 
tioned in the grammatical corpus. For example, the component balaka means 
boy and it is assumed that the user of grammar is familiar with the meaning 
boy that is connected with the phonetic form bàlaka. If we represent the 
phonetic form of any component p; by the symbol p/ and the lexical content 
inherently associated with it by p?", then this established association can be 
depicted as follows: 


pl: balaka 


d 


pi: boy 


Here, the nature of association is similar to that between the form of any 
linguistic expression z^ and its content x™ , namely established by usage. It 
is therefore depicted by an unbroken line. 


Paninian definition of nominal stems reflects this inherent association. 
Nominal stems are defined as those components that have an inherent 
lexical meaning—and not belonging to the collection of verbal roots and 
suffixes.*! Panini does not provide an exhaustive enumeration of the nominal 
stems. Instead he relies on the complement of finite sets of verbal roots 
and suffixes—which are enunciated in the grammatical corpus—and the 
condition that it must have lexical meaning. 


Not all components have inherent lexical meaning and in many cases 
grammar explicitly associates meaning-expressions to components. In the 
grammatical corpus, this association is usually in terms of the condition for 
introducing a component in the derivational process. For example, if the 
intention (vivaksa) of the speaker is to express present time, then the compo- 
nent l(a)(t) should be introduced.” In other words, the meaning-expression 
vartamüna (present time) is associated with the component l(a)(t). 


Al HIATT GAA: TEA NK > a meaningful component which is not a dhatu and 
not a pratyaya is pratipadika. 

2 qdWri GE W222 > to express present time, introduce l(a)(t). In Vakyapadiya 1.24 
Bhartrhari speaks of meanings associated to the components (apoddhara-padartha). These 
are not fixed but there are differences of opinion (vikalpa) as to which meaning-expression 
be associated with which component. In this sense, they are different from established and 
fixed meanings (sthita-laksana) of words. See (Rau 2002 p. 7) and (Cardona 1975 p. 280). 
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The user of Paninian grammar must be able to correlate her or his in- 
tention with the corresponding meaning-expression in the grammatical 
corpus and then decide whether she or he agrees with it or not. If, for 
example, the speaker wants to express present time, then vartamana is the 
corresponding meaning-expression. If the user agrees with it, then the com- 
ponent l(a)(t) can be introduced. This is how components are associated with 
meaning-expressions. The component l(a)(t) now carries the information 
that it is introduced when the user intends to express present time. Another 
example of this type of association is during the introduction of verbal roots. 
These are listed in the Dhatupatha with meaning-expressions that specify 
when a root is introduced.? A root like bhū would be introduced if the user 
understands satta (existence, being) and is certain that this is what she or 
he intends to express. As we will see in the following pages, Panini makes 
extensive use of meaning-expressions in his grammar. 


The meaning-expressions are represented by the symbol z7' . The let- 
ter x indicates that it is a Sanskrit linguistic expression, the sub-script j 
refers to any of the several such expressions in the grammatical corpus. 
Further, the super-script m indicates that it is the meaning or content of this 
expression which is relevant for grammar and not its form. The association 
of meaning-expressions z7' with the form of components pl can now be 
depicted as follows: 


pi: Ma)(t) 


N 
S 
\ 
S 
SN 


zy vartamüna 


The dashed arrow from z7’ to p! represents the introduction of the component 
having the form pf if the meaning-expression x7” is intended to be expressed. 


The third category is that of characterizing attributes which encapsu- 
late grammatical or semantical information. These are designated through 


25 There is a divergence of opinion among the scholars on the issue whether the meaning 
entries formed a part of the original corpus of the Astadhyayr as suggested by Johannes 
Bronkhorst (1981b p. 335-357) or were added later as maintained by Bruno Liebich (1919 
p. 47-53) and G. B. Palsule (1961 p. 91ff.). 

^ Paul Kiparsky and Frits Staal (1969 p. 84) use the term "semantic representations", 
Johannes Bronkhorst (1979 p. 150) uses the expression "semantic elements" and J.E.M. 
Houben (1999 p. 23-54) employs the phrase "meaning statements". My choice of the term 
is to emphasize their semantic character as well as the fact that these are not parts or com- 
ponents of the whole (hence "expression" instead of "element"). 
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several technical terms or safíjfias.?^ An attribute can embody some gram- 
matical aspect such as belongingness to a specific group of components. For 
example, Panini groups the suffixes (k)ta and (k)tavat(u) together and calls it 
nistha.*° In other words, he attaches the attribute nistha to these components. 


An attribute is represented by the symbol a; . The sub-script k indicates any 
of the several attributes that are specified in the grammatical corpus. The 
process of attachment of any attribute a, to the form of any component pf 
can be represented as follows: 


pf: (k)ta 


| 


ag: nistha 


The unbroken arrow in the figure above indicates the process of attachment 
of an attribute to some component. 


Attributes not only comprehend some structural or systematic charac- 
teristics, but in several cases they can also represent semantic aspects. In 
such cases, meaning-expressions are used to specify them. For example, 
the attribute kartr (agent) is defined in terms of the meaning-expression 
svatantrah or “that which is independent in performing an action"." There is, 
thus, an association of a meaning-expression xp with the attribute a; . This 
association, however, is deliberately made in the grammatical corpus.?? It 
is therefore represented through a dashed line. The following figure depicts 
this association. 


aj: karty ------ zy: svatantra 


The process of association of some meaning-expression to an attribute is dif- 
ferent from that of attaching an attribute to some component. In the above ex- 
ample, the first process would be association of meaning-expression svatantra 
with the attribute kartr and secondly, the attachment of the attribute kartr to 


45 There are numerous studies that deal with the Paninian technical terms particularly (Re- 
nou 1942), (Chatterji 1964), (Cardona 1970 p. 195-212), (Wezler 1976 p. 361-379), (Dvivedi 
1978) and (Singh 1979 p. 7-16). 

46 tampa FAST 19.2.38 Il» (k)ta and (k)tavat(u) are nistha. 

47 STA: Hl UN 2.4.48 ll P» that karaka which is svatantra or independent of others is kartr. See 
(Joshi and Roodbergen 1975 p. xviii-ix) for association of meanings or external information 
to the karaka terms. 

48 In the words of S. D. Joshi (2001 p. 156-157) : “Panini establishes links between grammat- 
ical notions and non-linguistic reality ...” Joshi sees the meaning-expressions as “semantic 
metalinguistic statements” which “are considered to be a device to assign interpretation to 
the linguistic structure”. 
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the appropriate component, for example, the nominal stem balaka. The two 
processes combined together can be represented as follows: 


pf: balaka 


aj: karty ------ gy svatantra 


The three possible categories at the content level of any constituent component 
can be represented collectively as follows: 


pi 


m m 
Di p cc—necme Tj 


The above figure depicts the general structure of any constituent component. 


1. 


The two levels are distinguished as the upper form level and the lower 
content level. A component p; has accordingly some form, which con- 
sists of a sequence of sounds and is depicted through the super-script f. 


Further, it may have a lexical meaning which is not deliberately speci- 
fied in the grammatical corpus. The phonetic form of lexical meaning is 
clearly the form of the component. This relation is similar to the one be- 
tween the form and content of any given expression. These relations are 
established through usage. Grammar makes use of this connection, but it 
does not establish them. This lexical content is represented at the content 
level by the symbol p?” with the super-script m denoting the lexical mean- 
ing or content of the component. The inherent or established nature of the 
relationship between pf and p?" is depicted through the unbroken line. 


Sometimes a component is introduced by specifying certain semantic con- 
ditions. These meaning-expressions that form a condition for introduc- 
tion of a component are represented by the symbol 27" . The dashed arrow 
stands for the fact that these meaning-expressions are deliberately men- 
tioned in the grammmatical corpus and form the condition for introduc- 


tion of some component. The form of meaning-expression is of no direct 
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relevance to the process of derivation. What is of relevance is the content 
that it represents. In many cases it needs to be completed with other terms. 
Meaning-expressions in the grammatical corpus are, so to say, indices that 
refer to some content a user is supposed to understand and make use of. 
Their understanding depends upon the user's knowledge of the language, 
grammar and the external world. 


4. The next category is that of attributes a; or names that are attached to 
any component. The attachment process is represented by an unbroken ar- 
row and is specified in a deliberate manner with the grammatical corpus. 
The fact that attributes are placed only at the content level indicates that 
their phonetic form is irrelevant for the process of derivation. Formally 
speaking, what is important is to have a unique index through which they 
can be referred to. What phonetic form this index has, is of no relevance 
for the grammatical process. 


5. Sometimes, the attributes are defined through meaning-expressions. This 
process is equivalent to associating a meaning-expression 2!" with an at- 
tribute a; which is depicted by dashed lines. 


The constituent components within the grammatical system of Panini can 
be represented in the above manner. It should be noted that the above 
representation makes use of the basic concept of two levels. At the form level 
the physical, audible sounds are represented and their content is represented 
through three basic categories. Further, four types of processes, depicted 
through four different kinds of connecting lines or arrows, are identified. 
Among them, only three i.e. (i) introduction of pl based on z7' (ii) attachment 
of aj, to pl and (iii) association of r7 to a, are specified in the grammatical 


corpus. The lexical meaning i.e. connection of p?” with the pf is not mentioned 
in it, but is taken to be established. 


As mentioned before, the prime aim of grammatical synthesis is to combine 
the constituent components in a rule-based manner, so that the form of the 
combination results in the form of the desired standard expression and the 
sum total of the content of the components corresponds to the content of the 
desired standard expression. If we represent the combination of components 
by the symbol X and distinguish the combination at the form level £f and 
content level X" then the grammatical process of synthesis can be depicted 
as follows: 
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The above figure is a more detailed version of the following figure which rep- 
resents that the synthesis is rule-based combination X of components p; that 
results in x. 


synthesis 
Pi — X 


The next task is to explain how X , or the process of combination, functions. 
In order to show this, I use an example to work out the grammatical synthesis 
and present it in terms of the categories and processes introduced above. 


1. Consider the Sanskrit expression balakah pathati (a boy reads) which is in- 
tended to be derived. We have: 


xf: balakah pathati 


x™: a boy reads 


Here, the connection between the phonetic form z/ and the meaning x” is 
established by usage. In other words, even without following the gram- 
matical process of derivation, someone conversant in Sanskrit would un- 
derstand what is meant by this expression. 


2. In order to derive this expression one must start by selecting the appro- 
priate constituent components that may be associated with the intended 
expression. 


a. The first selection would be the component p; : balaka whose lexical 
meaning is a boy. This amounts to saying that since the speaker in- 
tends to express p? : |balaka|| or ||boy|| the component with phonetic 
form pl : /balaka/ is introduced. This is represented as follows: 
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/balaka/ pf 
balaka pr 


It should be noted that the phonetic form of the component chosen 
is conducive to the phonetic form of the intended expression. Thus, 
components like māņavaka or bāla are not chosen, although they also 
convey the same meaning. 


b. The second component selected is p2 : path(a). In the Dhatupatha, the 
verbal roots are listed along with the meaning-expressions that con- 
dition their introduction. Thus, the verbal root path(a) along with the 
meaning-expression vyaktayam vāci implies that path(a) is introduced 
when “expressed speech” is intended to be communicated. Thus, in 
this case, the component with the phonetic form p/ : /p a th/ is speci- 
fied together with the meaning-expression x} : ||vyaktayam vaci|| or in 
case expressed speech is intended to be said. This introduction is therefore 
based on some meaning-expression and can be represented as follows: 


pj [path] 


x5 ||vyaktayam — vaci|| 


Again it should be mentioned that other components like lap(a) that 
are also listed with the same meaning-expression are not selected to 
avoid formation of lapati instead of pathati. 


Thus, the selection of the components requires prior knowledge about 
them, what they denote, their lexical meaning as well as the meaning 
associated in the grammatical corpus. Both their form as well as their 
content are to be taken into consideration. Further, the two components p; 
and p» need to be placed in this sequence, otherwise the expression pathati 
balakah may result instead of balakah pathati, which has the same meaning 
but a different form owing to different sequencing. 


The process of derivation can be considered as taking place through 
a series of changing derivational-states. At each derivational state 
a number of components, together with their contents are placed in a 
specific order. The derivational state after the introduction of p; and po 
looks as follows: 


40 2 System 
fonaka] pi pÉ [path] 
||balakal| pr x5 ||voyaktīāyīām — vaci|| 


3. At this stage, the process of characterization follows. It involves attach- 
ment of a number of attributes to the components, depending upon fulfil- 
ment of appropriate conditions. 


a. The grammatical corpus consists of a table of verbal roots or the Dha- 
tupatha. By the rule bhivadayo dhatavali'? Panini refers to it as “com- 
ponents beginning with bhü" and terms the components it contains as 
dhatu. Since the component path(a) is listed in the Dhatupatha, there- 
fore, an attribute a» : dhàtu is attached to ph : path(a). This process can 
be represented as follows: 


uU S 


pr dhatu: a» pu 


b. Next, the first component balaka is characterized as a nominal stem. 
The Paninian term for nominal stems is pratipadika. As mentioned 
previously, unlike the verbal roots, there is no extra list of nomi- 
nal stems in the grammatical corpus. Instead, they are specified by 
complementing the set of verbal roots and suffixes, provided they 
have some lexical meaning (arthavat). This is specified by the rule 
arthavadadhaturapratyayah pratipadikam™. Thus, the attachment of the 


attribute a, : pratipadika to the component př can be depicted as fol- 
lows: 
pi p3 
pr ai: prütipadika ag xy’ 


The attachment of the above attribute would include checking 
whether the concerned component has a lexical meaning and that it is 
not a verbal root or suffix. 


49 a Aaa: 2.2.2 ll > components in the list beginning with bhū are dhatu. 


50 HIATT GAA: TTA WK > a meaningful component which is not a dhatu and 
not a pratyaya is pratipadika. 
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c. The component bàlaka is also characterized as an agent. The Paninian 
term for agent is kartr. It is defined by the rule svatantrah kartta>'. The 
effect of this rule can be represented as the dashed line associating the 
meaning-expression x{ with the attribute a;. Here, to avoid further 
complex notations, I am using the same variable a; where the suffix 
only denotes that it is related to the first component p; and not the 
number of different attributes. 


Pi pi 
pr ài ----- vy az — cy 


kartr svatantra 


Further, the attribute kartr or agent would be attached to the compo- 
nents balaka if this is the intention of the speaker. 


E pl Ps i 
p 1 gaess oP l, E 


kartr svatantra 


In this manner, further attributes would be attached to the components at 
a particular derivational state until it arrives at a point of saturation, in 
the sense that no more attributes can be assigned to it. These processes of 
attaching attributes are therefore meant to saturate the current deriva- 
tional stage. 


4. Once a given derivational state reaches saturation, new components can 
be introduced to it, depending upon the fulfilment of grammatical con- 
ditions and the intention of the speaker. The next component which is 
added is the suffix l(a)(t). It is introduced when it is intended to express 
the present time.? A further decision which is to be taken is whether 
active voice is intended, and if this is the case, then whether the suffix 
l(a)(t) which is being introduced expresses agency? Moreover, as a suffix 
it must be placed after the verbal stem. This can be depicted as follows: 


51 qa: spl M 2.2.42 > that karaka which is svatantra or independent of others is kartr. 

52 adum BE 113.3.233 Il > to express present time, introduce l(a)(t). 

55g: BAR WIR AHA: 13.9.81 > lakara are used to denote the object or karman and 
the agent or kartr in the case of transitive verbs and after intransitive verbs they denote the 
action or bhàva as well as the agent or kartr. 
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balaka path l 
pi pj €—4 p$ 
S K 
P | m | À ^ Tri 
Pi a, --X, ag T2 T3 


There are two steps involved here. First, introduction of the component på 


on the basis of the meaning-expression x3’. This is depicted by the dashed 
arrow. The second step is placement of this component after the verbal 
root or the component pf, which is specified by the arrow with a tail. 


5. The next suffix s(u) is introduced after the nominal stem balaka and de- 
notes nominative singular case ending. For this, it must first be decided 
whether the suffix l(a)(t) associated with the verb expresses the agency. 
This implies that the attribute kartr or agent associated with the nominal 
stem is already expressed and therefore the first case ending can be intro- 
duced to it." In case, singularity is intended, the singular case ending s(u) 
is selected and introduced. 


balaka E path l 
p eri pj — — —— yi 
x | T |: [^ 
Dy — G---Zy age a ag a3 vy 


6. The following step involves introduction of the finite verb ending which 
replaces the suffix l(a)(t) after the verb. The desired component is the third 
person singular suffix ti(p). A number of decisions need to be taken and 
other grammatical constraints considered before its introduction. Since it 
is not in co-reference with a non-nominative,? the suffix l(a)(t) can be re- 
placed by one of the suffixes out of the tiv-group.°° Moreover, the fact 
that it is in co-reference with [ pı p4] implies that the replacement must be 


55 aed 93.28 > when not otherwise expressed. BIGEICCAR ICE (CIC CI: S- 
AAT 2.2.84 ll > to denote only the meaning of pratipadika, its gender, measure, and its 
number, prathama is introduced. 

55 BI: AJAA ANEAN ART 113.2. 239 Il > (S)at(r) or ($)ana(c) are introduced after a verbal 
root in place of l(a)(t) when the action is denoted at the current time and if l(a)(t) is not 
coreferential with a pada which ends in prathama or nominative case. 

56 GET 13.9.99 ll > in place of suffixes with cover term 1 namely l(a)(t), 1()(t), 1(u)(t), Kr)(6, 
I(e)(t), (ot), 1(a)(0), 1() (2), 1(u)(n) and I(1)(n) the suffixes coming in the next rule are substi- 
tuted. ferafegiferessesifirerenearatarmensteatirgienfew !3.9.9c ll > ti(p) tas jhi si(p) thas tha mi(p) 
vas mas ta atàm jha thas atham dhvam i(t) vahi mahi(n) are the substitutes of the cover term 
lor a lakara. 
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singular. Further, path(a) is a parasmaipada verbal root and neither first 
nor second person is being expressed, therefore, ti(p) is selected which 
replaces l(a)(t). 


balaka E path l ti 


pi — ppi ph py pi 


OBS NELW SS 


m m m m m 
Py Qj--T1 Q4 T4 ag Tə Q3 X3 a5 Ts 


In the above figure, the double arrow depicts the process of replacement. 


7. In the next step, an infix ($)a(p) is introduced after the verbal root and 
before the finite case ending. If the case ending suffix denotes agency— 
which is inherited from the suffix l(a)(t) which it replaced—and if it is a 
sarvadhatuka suffix” then the infix ($)a(p) is introduced. 


balaka S path a ti 
pl —— 5i Ph pss <p 
y MNA NÉ LRL 
* N N 
N * N N 
pi a -- LY a4 X3 Q2 X5 dg Xg a3 X3 a5 X5 


8. The next change follows the rule sasajuso rul??, with the substitution of 
r(u) in place of s. 


balakar pathati 


9. Finally, following the rule klaravasanayorvisarjantyal?? the phoneme r is 
replaced by visarjaniya because of the following khar (i.e. p) sound. 


balakah pathati 


From an operational point of view, both the above replacements are sim- 
ilar to the previous replacement of l(a)(t) by ti(p) in step 6. 


The process of derivation presented in the above example is slightly differ- 
ent from that suggested by Joshi and Roodbergen (1980 p. ix-xi). The main 
difference is that they start by associating the meaning-expressions with the 


57 This is because it belongs to the group called ti. fere fra MANTEA 103.9. 2*1 > tii and 
those having § as it marker are called sarvadhatuka. 
58 GGT T: 2.2.88 ll > final s as well as s of sajus is replaced by r(u) at the end of a pada. 


59 GRAMS: 2.3.24 > before khar or pause (avasana), r is replaced by the visar- 
janiya (= h) provided this r is final in a pada. 
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(karaka) attributes and introduce the constituent components like balaka or 
path(a) at a later stage. In my representation, the attributes occur together 
with the constituent components in a composite manner. There is no pure or 
deeper semantic or syntactic level, but the components are a complex of pho- 
netic form together with lexical, semantic and grammatical content. 


2.3 The derivational process 


The application of the grammatical system to synthesize a particular expres- 
sion is carried out in a number of steps. Each step can be said to correspond to 
some derivational state. Such a derivational state would consist of a sequence 
of components. Its detailed specification at the form and content level would 
then consist of respective categories. A general derivational state with two 
components is shown in the following figure. 


pl Py 
P 1 ^ Va T ET 
Pi ak -- Tj P?  Qk--Tj 


There are two components here, pı and pz, with their respective forms pf 


and ph and lexical contents pj’ and p3’. Further, a number of grammatical 
attributes a; or semantic content xi" can be attached to them. 


2.3.1 Guiding principles of synthesis 


The process of synthesis involves changes in the derivational state. A change 
could be at the content level, for example, when new attributes are attached 
or at the form level when new components are introduced. Any change in a 
given derivational state is guided by 


1. the intention (vivaksa) of the speaker, and 

2. the constraints of the grammatical system, which involves 
a. consistency of the derivational state 
b. its saturation, and 


C. its completion. 


6 See, for example, the stages A(1) to B(4) in (Joshi and Roodbergen 1980 p. ix-x). Brendan 
Gillon (2007 p. 451-461) follows a similar approach. 
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2.3.2 Intention of the speaker 


The intention of the speaker relates either to the meaning-expressions z7' or 
to the lexical meaning p?” of the constituent components. The grammatical 
system interacts with the speaker and gathers information about her or his 
intention through meaning-expressions. Intention is also related through the 
meaning of a lexical component. 


The meaning-expressions are explicitly stated in the grammar. A user 
must be able to understand them and react to them. For example, the 
meaning-expression vartamüna requires that the user understands what is 
meant by it—namely, present time—and decides whether she or he wants to 
express it or not. The lexical meanings, on the other hand, are not explicitly 
stated in the grammar. A user must be able to choose the right lexeme or 
component that corresponds to the lexical content she or he wants to express 
and input it into the grammatical system. This requires familiarity with the 
lexemes of the language. This is necessary because the form and content of 
such components are fixed or established through usage. 


It does not suffice to know only the meaning. Form is equally impor- 
tant. For example, if one wants to express balakah pathati, then one must select 
balaka and not bala or manavaka which also mean the same. Similarly, one 
must choose path(a) and not lap(a) although both are listed in the Dhatupatha 
in the sense of expressed speech (vyaktayam vaci). Otherwise, lapati instead of 
pathati may result. 


2.3.3 Consistency of the derivational state 


The rules of grammar provide for the consistency of a derivational state. 
The constraint of consistency regulates the distribution of the components 
and the categories expressing their content. At a given state, the sequence 
of components, their phonetic combinations and the presence or absence of 
grammatical attributes are specified in the grammar. 


As an example, consider the placement of suffixes (pratyaya). This is 
regulated by the rule which specifies that components with the attribute 
pratyaya must be placed after the component for which they are introduced.! 
Another example of the constraint of consistency is the presence of a par- 
ticular attribute in a component, which sometimes excludes the possibility 


61 Sea: 113.2.? ll > pratyaya are components introduced subsequently. WH 113.2.31l > and 
are placed after the components to which they are introduced. 
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of attaching certain other attributes to that component. Such attributes can 
be grouped within a set of mutually exclusive attributes. For example, one 
such set is (hrasva, dirgha, pluta]. If a phoneme unit is hrasva then it can 
not simultaneously be dirgha or pluta. Similarly, a component with attribute 
kartr can not simultaneously have the attributes karman, sampradàna etc. 


Only a consistent derivational state is admissible. 


2.3.4 Saturation of the derivational state 


Given a derivational state, a number of attributes can be attached to the 
components. Saturation of the derivational state is reached once no further 
attributes can be attached at that particular state. As long as some attribute 
can be attached to any component of a particular derivational state, it is not 
saturated. 


Attachment of attributes in general leads to grammatical characteriza- 
tion of components, as well as of the derivational state. They can be attached 
to the phonetic form of any component. This can be either a single sound i.e. a 
phoneme, or a sequence of them which may represent one or more constituent 
components. The first case has phoneme attributes and such attributes can 
be distinguished from other attributes, i.e. those that characterize phoneme 
sequences. Consider, for example, the component (k)ta appearing at some 
stage in the derivational process. The phoneme t is termed tu, khay, khar and 
a is ac, at, guna etc. On the other hand, the component (k)ta is called nistha, 
pratyaya, kit etc. 


There are many attributes, which once attached to a component, do not 
leave it. For example, the attribute ac for the phoneme / a /. This means, once 
attached, this attribute remains associated with that component in all the 
subsequent states as the derivational process advances. Such characteristics 
can be termed as static-attributes. On the other hand, certain attributes 
are assigned temporarily to some component. They depend upon the current 
derivational state and once its distribution changes, they no longer remain 
attached to that component. One such example is the attribute anga.© This is 
assigned with respect to some suffix and depending upon the distribution of 
suffixes in a particular state, it is attached to corresponding components, but 


8 In Astadhyayi this is specified under eka-sañjñā i.e. “assignment of only one term" con- 
straint prescribed by the rule: 3T HERI GH Gall 12.8.2 ll > upto a.2.2.38 only one sañjñā or 
attribute is attached (from among the set of mutually exclusive attributes). 

63 eqrercqafafaedara AASA 112.9. ?3 > that part which enjoins a pratyaya based opera- 
tion, before that the sequence is anga. 
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in the subsequent state things may change as may the assigned position of 
this attribute.5^ Other examples include ti® or upadha® that are assigned on 
the basis of current positions of the phonemes. 


Another criterion on the basis of which attributes are distinguished is 
whether some meaning-expressions are required for their assignment. If 
this is the case, then some kind of user intervention is required to interpret 
them and accordingly attach the attributes to the appropriate component. 
Consider the assignment of the attribute kartr, which is specified in terms 
of a meaning-expression svatantra.Ó? Here, the component which denotes 
the independent agency of action is assigned the attribute kartr. This 
involves assistance from the user. On the other hand, several attributes 
can be assigned only on the basis of distribution of system internal pa- 
rameters and without the help of the user. The grammatical system can 
decide on its own. For example, the attribute nistha can be assigned without 
consulting the user, solely on the basis of the presence of (k)ta or (k)tavat(u).5? 


Given a distribution of components, the derivational state must attain a 
state of saturation before new components can be introduced or changes 
can be effected at the form level. 


2.3.5 Completion of the derivational state 


A saturated derivational state is complete if no more components can be 
added to it. Neither the user, who interacts through the meaning-expressions, 
nor the grammatical system calls for addition of any component. Such a com- 
plete state must fulfill two conditions, namely (1) the phonetic form reached 
must be identical with the phonetic form of the intended linguistic expression 
and (2) the collective content of the completed state must correspond to the 
content of the intended expression. 


A derivational state, if incomplete, advances towards completion. This 
takes place when a new component is added to it. When it is placed adjacent 
to some existing component then it is a case of simple addition or augmen- 
tation (agama), and when it replaces some component then it is substitution 


64 James Benson (1990) discusses Patafijali’s remarks on aga. H. V. Nagarajarao (1978 p. 145- 
176) discusses the scope and necessity of the adhikāra of anga although Panini does not 
explicitly states its domain. 


65 ardisecarfa f& 12.2.21 > that part which begins with last ac is ti. 

66 ASAA TTA UN 2.2.44 > an al which is penultimate is upadha. 

87 AA: WA 2.4.48 > that karaka which is svatantra or independent of others is kartr. 
68 Hehe NSI 8.8.28 Il» (k)ta and (k)tavat(u) are nistha. 
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(adesa).? Augmentation or substitution of phonemes leads to change in the 
form of a component. Replacement of the entire components, however, not 
only leads to change in form but facilitates deeper layers of abstraction and 
generalization in formulation of grammar. The best example is the introduc- 
tion of abstract components like l(a)(t) and their complete replacement by 
case endings.” Panini specifies where the new components should be placed. 
For example, the suffixes are added after the component to which they are 
introduced.^! Components marked with t or k are added at the beginning or 
at the end of the constituents to which they are introduced. A component 
with m as marker is added after the last vowel of the component to which it is 
introduced.” Substitution involves a combination of two processes, namely 
the component which is replaced is made invisible by assigning an attribute 
replaced”, and the substitute is introduced at the appropriate position. 


2.3.6 Conditions 


The forces that induce some change in the derivational state are specified 
through conditions. The process of saturation of a derivational state, or its 
transition towards completion from one state to the next, depends upon these 
conditions. They are formulated in terms of the distribution of components 
and their contents in the current state, in the previous state and sometimes in 
the subsequent future state as well. The nature of conditions and their com- 
plexities will be discussed in the following chapters 3 and 4. 


© On the concept of substitution (adesa) and its possible origins in the Brahmana and Up- 
anisad texts, see (Thieme 1968 p. 715-723). See also (Wezler 1972 p. 7-20) on sthanin (sub- 
stituendum). 

70 For replacement techniques in Panini, see (van Nooten 1967 p. 883-902). See also (Joshi 
and Roodbergen 1975 p. xvii-xix). 

71 JAA: 13.2.2 ll > pratyaya are components introduced subsequently. WA 13.2.31l > and 
are placed after the components to which they are introduced. 

72 medi ZER 12.2.98 > a tit element is placed at the beginning and a kit at the end of the 
component to which it is assigned. 

73 EASA: 2.2.89 ll » a mit element is is attached after the last ac of the component to 
which it is assigned. 

74 This is a new attribute which I use, instead of lopa. It represents those units which are 
now replaced because of substitution. In normal application of grammar, sometimes it is 
retained in memory (e.g. replacement of lakaras) and sometimes it is safely forgotten (e.g. in 
phoneme replacements). 


Chapter 3 


Frameworks 


The corpus of Astadhyayi formulates the grammatical system specified in the 
previous chapter in a special framework. I will mention some of the salient 
features of the Paninian formulation in the following section. The present 
chapter, however, introduces a new formal framework which facilitates a 
re-presentation of the Astadhyayi. The framework I propose is different from 
the one in which the Astadhyayi is formulated. 


The most significant difference between Paninian and formal represen- 
tation is that of the medium. Without entering into the debate about whether 
script was known to Panini or whether he made use of it when compiling 
his grammar, based on the manner in which the grammar is formulated— 
e.g. the use of accents or phonemes as markers—it can be asserted that 
Astadhyayi is composed, transmitted and applied in an oral framework. 
Moreover, it is composed in Sanskrit, although it employs a number of special 
meta-linguistic conventions. A formal representation, on the other hand, is 
meant to render it in terms of logical relations that can be implemented on a 
computer. 


The reason for proposing a new framework is that the Paninian formu- 
lation, although special on several counts, is not adequately formal for a 
direct implementation on computer. Apart from its oral and verbal character, 
Astadhyayi is meant to be used by a person who has acquired skills to apply 
it through the study of grammar. The rules of grammar are not listed in 
the order in which they are to be applied for derivation of any linguistic 
expression. Instead, rules having common contexts are grouped together 
to avoid repeated and redundant enunciation of these contexts. A formal 
representation, however, is meant to be applied by a computer program with 
the aid of a user. For this, a mechanism needs to be developed by which the 
required information scattered in the grammatical corpus can be collected 
together using various indices encoded in a complex meta-language. 
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An important challenge in Paninian formulation is the presence of am- 
biguities, inconsistencies and the incomplete nature of the Astadhyayr. T 
will discuss some of these issues in section 3.2. While the commentaries 
and other explanations available in the later grammatical literature help in 
clarifying them, a device is still needed to incorporate these explanations 
and suggestions. Instead of directly modeling the corpus of Astadhyayi, the 
present approach models the categories and processes outlined in the pre- 
vious chapter about the grammatical system. It also facilitates incorporation 
of information necessary for application of the grammar but not explicitly 
mentioned in the grammatical corpus. 


3.1 Salient features of the Paninian formulation 


The formulation of the grammatical system in the Astadhyayi of Panini is in 
the form of concise statements—the sütras.! Typically, a sūtra consists of one 
or more inflected expressions. In the following, I shall call them elements of 
a siitra. These elements are inflected just like any other expression in Sanskrit. 
For example, in a sūtra like veh padaviharane? the element veli is the ablative 
singular of vi and padaviharane is locative singular of padaviharana. Panini's 
description of Sanskrit, therefore, is in Sanskrit. 


Yet not all the elements within the grammatical corpus belong to the 
common language. There are many elements that appear only in the gram- 
matical corpus. In the above sūtra, the second element padaviharane belongs 
to the common speech and the first one vel is an entity which exists only 
within the grammatical system. There is thus a clear distinction between the 
object language which is being described i.e. Sanskrit and the language of the 
Astadhyayt, which is a special language of description. From Panini onwards 
the tradition clearly recognizes two types of languages: the perennial utter- 
ances of the Vedas and the established language of everyday communication 
on the one hand and on the other hand the artificial language of grammatical 
instructions. The language of grammar is for the specific purpose of pro- 
viding a description of the standard usage and is artificially created by the 
grammarians. In this sense, Panini is not the creator (kartr) but the one who 
re-collects (smartr) the object language? 


1 Louis Renou (1963 p. 165-216) provides a detailed survey on the genre of sūtra in Sanskrit 
literature. See also (Staal 1992 p. 303-314) and (Wezler 2001a p. 351-366). 

28: qTafExUI I t.3.9 ll > àtmanepada suffixes are attached after kram(u) if it comes together 
with the preverb vi to express nice strides (padaviharana). 

3 A detailed discussion of the traditional points of view on this topic is provided by R. S. 
Bhattacarya (1966 p. 212-237). Hartmut Scharfe (1971 p. 2-6) also gives a summary of the 
main arguments. 
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The artificiality of the language of Astadhyayi also lies in the fact that it 
follows only partially the rules of grammar that it lays down. In many cases 
it is guided by a special set of rules, namely the meta-linguistic conventions. 
For example, although the elements of the grammatical corpus consist of 
inflectional suffixes like any other linguistic expression, the meaning which 
they denote is different compared to what they express in common usage. 
While the rules under karaka-section relate to common usage, in case of 
elements of the grammatical corpus several meta-linguistic conventions 
provide for other denotations. Thus, for example, the convention sasthi 
sthaneyoga@ specifies that the sixth case (sasthi) or the genetive case-ending 
denotes the place where a substitution takes place (i.e. the substituendum). 
The right-context is expressed through locative endings and this is specified 
by the meta-rule tasminnitinirdiste püroasya$. Similarly, the convention tas- 
madityuttarasya’ specifies that the left-context is denoted through ablative 
endings.? 


Considering the special meta-linguistic conventions followed in the for- 
mulation of the grammatical corpus, and the fact that several of these 
conventions are explicitly mentioned in the same corpus, the statements of 
Astadhyayi can be grouped into two basic types: 


1. Operational rules that introduce the grammatical components, character- 
ize them and provide for their combinations. 


2. Meta-linguistic conventions about the formulation of the operational 
rules. 


The first group deals with what is to be done and the second one with how it 
is presented or formulated in the Astadhyayt. 


In the previous chapter, I introduced three kinds of constituents of grammar: 
(i) the components of linguistic expressions (ii) the attributes that characterize 
them and (iii) the meaning-expressions specifying the semantic information. 
As mentioned before, within the grammatical corpus, these constituents 
occur in an inflected form. The combination of the inflectional suffix with the 
base follows the normal rules of suffix additions and phoneme combinations 


^ The section on karaka is from 1.4.23-1.4.55. For a detailed discussion on the karaka's see 
(Cardona 1974) and the Karakahnika of the Mahabhasya (Joshi and Roodbergen 1975). 

5 NT EAT 10 9.2.9 ll > sasthi stands for ‘introduction in place of’. 

6 aftafatarates qa I| $.$.8& ll > locative case indicates that the grammatical operation is to 
be applied to the immediately preceding component. 

7 ARATA 112.2.891l > ablative case indicates that the grammatical operation is to be 
applied to the immediately following component. 

8 It should be noted that the meta-linguistic conventions are not universally applied in a 
consistent and consequent manner throughout the Astadhyayr. See the discussion in sec- 
tion 3.2. 
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laid down in the Astadhyayr. For example, the attribute vrddhi is stated as 
vrddhih in the rule vrddhiradaic?. Similarly, the meaning-expression svatantrah 
in rule svatantrah kartta'? is mentioned together with the inflectional suffix 
for nominative singular. 


Rules of phoneme combination and compounding are applied as well. 
The components have a special presentational scheme. While the meaning- 
expressions and many (but not all) attributes are from the natural language, 
the components do not belong to the common usage. Their artificial character 
is reinforced by the use of marker-sounds that are often attached to them 
either at the beginning or at the end. Thus, in a component like ti(p), the final 
sound p is a marker. In this book, markers are pointed out by including them 
within brackets ( ). Similarly, the component ($)a(p) has a marker $ attached 
at the beginning and another marker p at the end. 


Apart from markers, the components sometimes have a vowel which is 
needed just for facilitating its pronunciation. Thus, n[u](m) consists of the 
vowel u which is not part of the component, but is added in order to be able 
to pronounce this component. Such extra vowels are noted within square 
brackets [ ] in this book. 


3.1.1 Methodology of Paninian formulation 


The elements of the grammatical corpus are threaded within a number of 
inter-connected sütras (lit. a string). A sütra constitutes a distinguishable 
unit of the grammatical corpus with a specific function." Although sūtra 
is a distinguishable functional component, it is not always an independent 
unit and in most of the cases several sütras must be read together in order 
to execute a grammatical operation. A significant Paninian device which is 
employed for this purpose is the carrying over of elements from previous 
to subsequent rules (anuvrtti).? Moreover, many rules operate within the 
domain set by what are known as adhikara rules. In this manner repetitions 
are avoided and enunciation of an element at one place suffices for its use 


9 arate 2.2.2 > vrddhi stands for at and aic. 

10 gaa: sail l9. 421 > that karaka which is svatantra or independent of others is kartr. 

!! Tradition speaks of six kinds of sütras based on their function: sañjñä (definition), paribhasa 
(convention), vidhi (injunction), niyama (restriction), pratisedha (prohibition) and adhikara (ex- 
panse). See (Abhyankar 1974 p. 432). 

12 S. D. Joshi and Saroja Bhate (1984) have comprehensively examined the principles behind 
such an organization of the grammatical corpus in a monograph titled "The fundamentals 
of anuvrtti" . 
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at other parts of the corpus.? The method of anuvrtti functions at the level 
of grammatical elements. In most of the cases it is only a part of the sūtra 
which is carried over to the subsequent rules. Consider for example the 
rule krnmejantah'*, which says that those expressions that end in krt suffixes 
having m or ec at the end, are termed avyaya. The term avyaya here is to be 
taken from the rule svaradinipatamavyayam® where it is first introduced. So 
a complete reading would be krnmejantah avyayam, where the term avyaya 
is collected from a foregoing rule. Not the entire rule, but only a part of it is 
carried further.!ó 


Since application of a grammatical operation requires elements from more 
than one sütras to be collected together, it becomes necessary to specify the 
boundaries of elements within the grammatical corpus. In fact, grammatical 
corpus can be considered as a sequence of elements where sütra-boundary is 
marked as well. 


The conciseness of a sütra is firstly because as a unit it is not always a 
complete and self-contained expression of what is to be said, but only a part 
of the whole statement. It needs to be completed by incorporating other 
information. The information lacking may be present in some other part of 
the corpus and must be fetched to make a complete and applicable statement. 
Sometimes the missing information is to be provided through conventions 
and interpretations not explicitly mentioned in the grammatical corpus. A 
sütra therefore is a unit within an inter-dependent and inter-related network. 


The organization of the grammatical corpus in terms of the sütras con- 
sisting of one or more elements, points towards the fact that the formulation 
of the grammatical corpus follows the same methodology as the grammar 
itself. As specified in the previous chapter, this approach is to analyze a 
given whole into constituent components and then provide a rule-based 
combination of the components to specify the whole. In the case of the corpus 
of grammar, the sütras (or parts thereof) are components that need to be 
combined with other components (i.e. with other sūtras or parts thereof) in 
order to fetch an applicable complete rule of grammar. How the different 
components of an applicable rule are to be gathered together is guided by 


13 H, V. Nagarajarao (1978 p. 145-176) discusses the scope and necessity of the adhikara of 
anga although Panini does not explicitly states its domain. The meaning-adhikaras in the tad- 
dhita section of the Astadhyayr are analyzed by Saroja Bhate (1987 p. 81-92). Ashwini Deo 
(2007 p. 1-37) shows that the taddhita section of the Astadhyayr is structured as a default 
inheritance hierarchy. 

14 FAA: 119.2380 > expressions that end with those krt suffixes which end in m or ec are 
avyaya. 

15 RAAE, ll 2.2.29 > svaradi (svar etc.) and nipata are avyaya. 

16 Tn his six volume edition of the Astadhyayr, Rama Nath Sharma (1990 p. ix-x) has pro- 
vided anuvrtti elements from preceding sütras. 
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a number of meta-linguistic conventions, as well as several interpretations 
supplied by the later Paniniyas." 


As mentioned in the previous chapter, grammar comprehends the stan- 
dard usage by analyzing and identifying the constituent components and 
then providing a rule-based combination of them. Similarly, the formulation 
of the grammar is achieved in terms of sütras (including their constituent 
elements) and together with conventions to synthesize the operational 
statements. 


The purpose of opting for the same methodology is also similar, namely 
to provide a compact encoding of the grammatical information as well as 
to safeguard it against future loss or corruption. Several techniques are 
employed to this effect. The most important method is to enunciate first the 
general characteristics and then to specify the exceptions.'® Components that 
share common grammatical characteristics are linked together. Two such 
groups are relatively large and enumerated separately. These are the list of 
verbal roots (Dhatupatha) and of selected nominal stems (Ganapatha). There 
are several sub-groups within these groups and they are frequently referred 
to in the main corpus.? Formation of groups is attained using marker 
sounds as delimiters which facilitate specification of sigla (pratyahara).° The 
it-markers as indicators that are directly attached to components serve to 
form groups as well. 


Meta-linguistic information is also encoded in the intonation of the com- 
ponents and sütras. Thus, according to the convention svaritenadhikaral?!, 
the circumflex accent (svarita) indicates that the particular sūtra (or a part 
of it) is a heading rule. Similarly, the verbal roots are specified with accent 
markers which aid in deciding the kind of suffixes that should be attached 
to them. For example, following the rule anudattanita atmanepadam™, the 
atmanepada suffixes are attached if the verbal roots are marked by a low 
pitched (anudatta) marker vowel. 


Panini also uses the principle of correspondence and instead of men- 
tioning individual pairs he correlates collectively two lists of equal length. 


17 Despite copious literature on this subject the task is by no means finished. For example, 
there are cases where scholars have divergent opinions as to the boundary of a sütra. 


18 Patafijali notes this in (P. 53), see (Joshi and Roodbergen 1986 p. 77). 


1 For example, the ajadi group is referred to in the rule AATA ll9.?.9ll » after ajadi 
components and those ending in at, suffix (t)a(p) is introduced to denote feminine. See the 
Ganapatha in (Katre 1989 p. 1265-1325). 


20 See section 4.1.1. 
21 vafe ER: 2.3.22 ll > through svarita a domain is marked. 


22 spart ACHAT 2.3.2 > after anudáttet or nit components àtmanepada suffixes 
are introduced. 
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The convention yathasamkhyamanudesah samanam” specifies this. The rule iko 
yanaci** is an example. Here ik = [i, u, r, I] and yan = [y, v, r 1] are correlated 
respectively. 


For the specification of it-markers, Panini makes use of rules. More- 
over, the names of the sets of phonemes (pratyāhāra) are generated in a 
rule-based manner. The convention adirantyena sahetā™® is used to coin the 
names of the collections in a generic manner. Thus, Panini not only attains 
brevity by using the acronyms or pratyahara instead of using the entire list 
every time, but he also generates their names using just one rule. 


The employment of several of the techniques mentioned above to achieve 
maximum compactness has the consequence that the grammatical corpus 
attains a complex structure. This requires commentaries and explanations 
to make explicit the encoded information in an unambiguous manner. The 
task of formalization of the grammar needs therefore to take this aspect into 
consideration. 


The formulation of the grammatical system does not furnish in an ex- 
plicit manner the process of derivation which involves operations that are 
carried out one after the other in a sequential manner. These operations 
are conditioned. To execute this process, one would ideally expect that the 
next applicable operation is unambiguously specified. It means that there 
is a unique operation and all the conditions that are to be satisfied are also 
available. A sūtra, however, is not always prescribing a unique operation and 
seldom does it have the complete set of conditions mentioned at one place. 
For example, the sūtra: kartuh kyan salopasca”’ provides for two operations: 
introduction of the suffix (k)ya(n) and elision of the final phoneme s. For the 
conditions other sütras like dhatoh karmanah samanakartrkadicchayam vi? and 
supa atmanah kyac?? etc. need to be taken into account.” 


23 RREA: CAPT I 2.2.26 ll > respective assignment for equal number of elements. 

24 gj ATTA 18.2.9 ll > ik is replaced by yan if it is followed by ac. 

25 The rules for it markers are: TISAI AGATA Sq 112.33 1l > in grammatical instruction, an 
ac which is anunasika is it. 85774, 2.3.31 > hal coming at the end. à RTE gem: uei il 
> but not in vibhakti, the tu, s or m. amare: 2.3.41 > the initial Ai, tu and du (of dhatu). 
q: IATA 2.2.8 ll > s of pratyaya. FE NWR » cu or tu. Balas We > 1, é and ku of 
non taddhita. 

26 RAA FEM 2.2.9? ll m an initial element together with the final it sound includes 
intervening elements. 

2 ad: AS HATA 113.2.22 ll > after kartr (K)ya(n) is introduced and the final s is elided (lopa). 
28 Tal: BAN: GATT AHU TTA 3.2.91 > after dhatu optionally in case its action is karman 
of some dhatu expressing desire and both dhatu have same kartr. 

29 Q4 ANAA: FIL 13.2.6 ll > after sup to express desire for oneself, (k)ya(c) is introduced. 

30 For details see (R. N. Sharma 1987 p. 46). 
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Here it is important to note that it is not that the grammar does not specify 
the conditions, but only that the specification through the sitra-style is such 
that these are distributed at different locations and need to be gathered at 
the time of application. This process of bringing all the requisite information 
together in one place is carried out by a human user on the basis of her or his 
knowledge of the Astadhyayi, especially the knowledge of its meta-linguistic 
conventions. 


In an article published in 1975, Rama Nath Sharma (1975 p. 31-39) dis- 
cusses this aspect. He suggests that there is an implicit device which works 
via reference to retrieve information necessary for the proper application 
of rules. He refers to the employment of domains and recurrences (adhikara 
and anuvrtti) as well as the use of technical terms, which necessitates the 
construction of what he terms as referential indices, so that the required 
information can be gathered for the application of rules. In the first volume 
of his six volume edition of the Astadhyayr he elaborates this idea further 
providing ample examples (R. N. Sharma 1987 p. 60-73). It suffices here 
to mention that his suggestions are largely an attempt to note down the 
(mental) process through which relevant information is gathered in order 
to apply the rules of grammar. It is clear that in a formal representation, 
which a computer should be able to understand and apply, this process 
needs to be made explicit. Although Sharma rightly points out its need and 
explains how it can be done following applicational procedure (prakriya) of 
Panini, he does not provide a practical framework in which it may be realized. 


In contrast to the suggestion put forward by R. N. Sharma (1975 p. 31), 
I propose that it is necessary to recast the content and processes of grammar 
in a new formal framework. The framework which I intend to introduce 
differs from the way in which the Astadhyayr is formulated by Panini. The 
prime focus of the new formal framework is to integrate the representational 
and applicational aspects of grammar. The rules, once recast in the new 
framework, can be interpreted and applied in an algorithmic manner and 
the derivational process can be carried out with the aid of computer systems. 
Another important aspect of the new framework is its non-oral or non-verbal 
character in a formal and unambiguous manner. This enables on the one 
hand the object and the meta languages to be clearly distinguished, and on 
the other hand facilitates computer implementation. 


The proposal I make to recast the Astadhyayi in terms of a new formal 
framework is significant with far reaching consequences for the organization 
and presentation of the grammatical corpus. An immediate question arises 
here: why should one attempt to represent the Astadhyayi in a new frame- 
work? Why not keep the Paninian formulation that evinces several formal 
features? After all, it follows a systematic and advanced meta-language 
which, to a large extent, is employed in an unambiguous and consistent 
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manner. Further, looking at the large amount of literature claiming the 
Astadhyayi to be the oldest and best example of formal representation, it 
seems an uninvited exercise. 


Despite several advantages—like safeguards against human frailty— 
achieved through concise, well knit, verbal aphorisms in which the Astad- 
hyayi is formulated, its complex meta-language requires an elaborate and 
comprehensive apparatus for interpretation and application of the grammat- 
ical corpus. The framework in which these are formulated is suitable for oral 
transmission and human application of the grammar, but not suitable for its 
formal representation and algorithmic application. In the following I will first 
discuss the problems of formalization of the Astadhyayr, and then introduce 
a new formal framework to represent it. 


3.2 Formalization of the Paninian corpus: challenges and 
possibilities 


There are certain practical challenges if one decides to keep to the verbal 
framework in which Astadhyayr is formulated. First of all, there is no critical 
edition of the Astadhyayr of Panini or other important earlier texts like 
the Mahabhasya of Patafijali—in which the Varttika of Katyayana are also 
embedded—and later texts like the Kasikavrtti of Jayaditya and Vamana. All 
the pioneering editions in the last two centuries are vulgate editions without 
stemma. The issue was recently raised by Michael Witzel in an animated 
post to one of the electronic discussion groups, where he bemoaned the lack 
of efforts in preparing critical editions of basic Paninian texts?! Responding 
to this scholars like Johannes Bronkhorst downplayed the expectations that 
such critical editions might fulfil, for example, to shed some extra light on the 
exact date of Panini or issues regarding the later additions of certain portions 
of Astadhyayi. Madhav M. Deshpande, notes that “[T]his debate concerns 
a state of the Astadhyayi that predates Katyayana and Patafijali, and no 
manuscript material takes us back to that period". Therefore, in his opinion 
“crucial questions of historicity of various sections within the Astadhyayi 
cannot be resolved with critical editions based on very late manuscripts.”°? 


3! See the discussion group: Indo-Eurasian, research (http: //groups.yahoo.com/group/ 
Indo-Eurasian research/) message 6303 posted on March 19, 2007 by witzel michael (ac- 
cessed on 12.01.2013). 

?? Bronkhorst (2008a p. 475-484) supplies a detailed reply to Witzel on this issue. 

33 See the discussion group: Indo-Eurasian research (http: //groups.yahoo.con/group/ 
Indo-Eurasian research/) message 6311 posted on March 20, 2007 by deshpandem (ac- 
cessed on 12.01.2013). 


58 3 Frameworks 


A joint project, however, is being carried out by École pratique des hautes 
études, Paris; Facolta di Studi Orientali, Universita La Sapienza, Rome; and 
Bhandarkar Oriental Research Institute, Pune to bring out a critical edition 
of the Kasikavrtti.*4 In this context, a notable publication is a critical edition 
together with translation of a section of Kasikavrtti on pratyaharas, which is 
an edited volume by Pascale Haag and Vincenzo Vergiani (2009). Despite 
these ongoing efforts, it would take some years till we have any fully fledged 
critical editions of some important early texts on Paninian grammar. And 
even then, the question whether it may be helpful in resolving any question 
of significance, remains open.” 


The exact number of sütras in the Astadhyayi is not fixed. An approxi- 
mate count is close to 4000. Moreover, as Hartmut Scharfe (2009 p. 33) notes: 
“(W]e have no independent assurance that the division of sütras in our 
traditional text is always the one intended by Panini". Further, changes in 
the boundary of rules may lead to differences in the number and divisions 
of the sütras. Sometimes a sūtra, which is traditionally given as one single 
rule, is divided into two for explaining the formation of certain words 
which otherwise are likely to be stamped as ungrammatical formations. This 
technique is called yogavibhaga and sütrabheda (Abhyankar 1974 p. 318,432). 
Joshi and Roodbergen (1991 p. 20-23), for instance, split the rule sambuddhau 
sakalyasyetavanarse?? into two parts: sambuddhau sakalyasya and itau anarse and 
combine the two rules ufial? and iL?? into one. 


There is, thus, no clarity about whether the verbal formulation of the 
grammatical system is without corruption. Even if one decides to begin with 
the grammatical corpus as laid down in modern editions of Astadhyayr, and 
agrees on a standard version? several major problems still persist which 


34 For more information, see: http://bori.ac.in/manuscriptN department.html (ac- 
cessed on 12.01.2015). 

35 Johannes Bronkhorst (2008a p. 482) remarks: "My expectation is that, even if all Paniniyas 
were to mend their ways and spend their time making critical editions, and even if Witzel 
were to live to see the result, he might not find in (or through) these editions the answers 
he is looking for. To find these answers, other ways may have to be explored." 

36 gel MEARAN 2.2.28 > according to Sakalya o(t) is termed pragrhya if it is sam- 
buddhi and when iti that is not Vedic (arsa) follows. 

37 FA: 2.2.29 ll > according to Sakalya u(ñ) is termed pragrhya when iti that is not Vedic 
(arsa) follows. 

38 H W2.2.2¢ > according to Sakalya anunasika ü replaces u(ñ) when iti that is not Vedic 
(arsa) follows and it is termed pragrhya. 

?? There are different readings in modern editions as well and the task to prepare a stan- 
dard version would be a time intensive exercise. Some efforts in this direction are made by 
Wiebke Petersen and Norbert Endres under the project “Pratyaharas or features? A qualita- 
tive analysis of phonological descriptive techniques—a comparison of Panini's pratyaharas 
and phonological features" (Personal communication). Project website: http: //panini. 
phil.hhu.de/?section-home (accessed: 22.02.2013). 
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have more to do with the nature of the Astadhyayi then the non-availability 
of its critical edition. 


The rules of Astadhyayi are not always stated in an explicit manner 
and require ample interpretation for their comprehension and application. 
The grammatical corpus needs to be supplimented with missing expressions, 
examples, counter examples etc. It is only then that it can be put to use. 
Mere recitation of the grammatical corpus (or in our case, inputting it in the 
computer) does not suffice. Not even resolving the sandhis and identifying 
the individual words. A user must exercise logical interpretations based on 
her or his knowledge of the grammatical system and the meta-linguistic 
conventions employed in the formulation of this system, in order to be able 
to apply it. Patafijali points out this in the following lines: 

It is not that one derives linguistic expressions only through rules of grammar. Then 

how are the linguistic expressions derived? Through rules of grammar together with 

reasoned explanations (vyakhyana). Now, if it is to be said that reasoned explanation 

is nothing but rules separated into constituent parts, then it is not correct, because it 

is not just dissected words such as vrddhih at aic (of a rule like vrddhiradaic*). What, 

then, is reasoned explanation? It consists of examples, counter examples, completion 

of statements by filling the missing words. Reasoned explanation is all this combined 

together.“ 


It follows that one needs to provide mechanisms for incorporating the 
reasoned explanation (vyakhyana) as well, if the formalization is to be put 
to application. This, however, is not exhaustively mentioned in the rules of 
grammar. In other words, the grammatical corpus of Astadhyayi does not 
explicitly incorporate the understanding of its application. Therefore, for the 
purpose of application of grammar, mere rules of Astadhyayi are not enough. 


Another problem which makes the task of formalization of the Pàninian 
grammar difficult is the lack of consistent application of its meta-linguistic 
conventions. This point can be best presented through the detailed study of 
the principles of anuvrtti (carrying over of earlier components to subsequent 
rules) by S. D. Joshi and Saroja Bhate. It shows that although the conventions 
for anuvrtti hold good for a number of rules, there are several counter 
examples. To quote Joshi and Bhate (1984 p. 252): 


The conventions of anuvrtti, which are called rules in a loose sense, are valid in the 
majority of cases. We do not claim that they are valid in each and every case. [...] 
There are counterexamples which go against the conventions of anuvrtti, yet the most 
salient fact about the assumption of these conventions is that they can not be given 
up, even in the face of apparent counterexamples. These contradictory examples do 


2 FRIST 2.2.2 ll > vrddhi stands for at and aic. 

41 q È xp: ws aAA | fà af! eme: cui dd ue Pere areas wala mao 
Tata Area qfi: end d afer! fi afé | Sere scqaresut aaa: afa aacagfad emer wai 
(P. 122-124). 
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not disprove our conventions. Therefore we claim that the apparent counterexamples 
are clearly manifestations of later insertions. 


The reason why Joshi and Bhate do not want to give up the conventions is that 
otherwise the functioning of the grammatical system would be impossible. 
At the same time the presence of counter-examples shows the impediments 
to the process of formalization. On the basis of the inconsistencies in the 
systematic use of the principles of anuvrtti, they propose later insertions of 
rules in the Astadhyayt. In particular, they point out that insertion of the 
Vedic rules and nipatana rules, the rules dealing with samasa and taddhita 
formations leads to the irregularities in the application of the conventions of 
anuvrtti. 


The hypothesis that there must be successive additions is further cor- 
roborated by the presence of "conflicting and incompatible elements in 
different parts of the text". In this regard, the following remark of Joshi and 
Bhate (1984 p. 253) is significant: 
For instance, the same suffix is referred to differently. The instrumental suffix (t)a 
(4.1.2) is reffered to as a(n) in 7.3.105 and 7.3.120. The accusative dual suffix au(t) 
is referred to as au(n) in 7.1.18. The Astadhyayi (for instance, 7.2.9) refers to cer- 
tain elements which are not prescribed in its basic layer. Instead of the genetive 
case, the nominative is used in the samasa-section to indicate that A becomes B (i.e. 
sthanyadesabhava). 


Even if one is ready to ignore or amend the irregularities in consistent applica- 
tion of the meta-level conventions, it does not suffice for a direct formalization 
of the corpus of Astadhyayr. The reason for this lies in the very nature of some 
of these conventions. As an example, one may quote a few of the principles 
specified by Joshi and Bhate in case of anuvrtti: 


Sometimes to have a proper interpretation of the rule the locative case is to be changed 
into the nominative. (Convention no. 18c). Topics or sections need not necessarily 
be introduced by the section-heading rules. They can be understood on the basis of 
Panini’s style of phrasing the rules. (Convention no. 161). If the same section heading 
(adhikara) occurs in different sections, it must have some reason to do so. (Convention 
no. 18k).2 


An examination of the above conventions makes it evident that although 
many of them are conducive for recognizing certain patterns in the formula- 
tion of the rules, they are not sufficient for a formal representation which a 
computer can understand and implement. 


One further handicap is the language of the Astadhyayt itself. The grammat- 
ical corpus is in Sanskrit. This means, the rules of the grammar are applied 
to the language of the grammar as well. Yet they are applied selectively and 
not in a consequent manner. Not all the rules that are applied for formation 


? For the list of these conventions, see (Joshi and Bhate 1984 p. 271-279). 
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of linguistic expressions are applied in the formulation of grammatical 
expressions. In the words of S. D. Joshi and J. A. F. Roodbergen (1991 p. 2): 


Panini’s sütra-language differs from ordinary, literary Sanskrit in this that the rules 
followed in this type of Sanskrit are not necessarily applied to the sütra-language also. 
In literary Sanskrit a samaharadvandva ending in /-c/ takes the samasanta-suffix (t)a(c) 
(5.4.106). For instance, vaktvacam “the aggregate of voice and skin". But this rule is 
not applied to ddaic. An important reason for not applying Panini's own rules to his 
own sütra-language is the concern to keep the expression clear and unambiguous. 


In other words, the rules of Astadhyayi are applied to the language of Astad- 
hyayi only as long as clarity of the grammatical corpus is not undermined. 


Another problem is the loss of meta-linguistic information, especially 
the accentuation and nasalization of the grammatical elements. According to 
the convention svaritenadhikarali^ a circumflex accent (svarita) indicates the 
beginning of a domain (adhikara). But as P. S. Subrahmanyam (1999 p. 182) 
notes: 


The technical use of the circumflex accent was lost in later times along with the gen- 
eral loss of accent in the language. The author of Kasika (7th century A.D.) acknowl- 
edges this loss when he remarks: pratijfiasvaritah panintyah which amounts to saying 
“the circumflex accent is understood to be present only there where the Paninian 
scholars think that it should be present". 


Moreover, the question as to which vowel of an adhikara-sütra is accented is 
also not clear. According to Subrahmanyam (1999 p. 182): 
Patafijali did not touch this problem. But the author of the Kasika and his commen- 


tators (i.e. the authors of the Padamafijari and Nyasa) think that all the vowels of an 
adhikara-sütra have the circumflex accent. 


The same problem is faced once one wants to decide which sounds in a 
component are it-markers. The meta-rule upadese’janunasika it? says that the 
nasal vowels of a component should be an it-marker. But the nasalization 
of vowels in the original instruction was also lost and Kasikavrtti notes 
that “the Paniniyas decide about the nasalization on the basis of established 
agreement on this”. Similarly, accents (udatta, anudatta and svarita) were 
also used to mark the verbal roots. In this case as well, the accentuation 
was lost and later grammarians had to make separate lists of roots carrying 
these accent markers." Meta-rules also do not cover all the eventualities. 
The set of meta-rules specifying the sounds that are it-markers in the original 


43 aAA U14.%.208 ll > the suffix (t)a(c) is introduced after a dvandva compound 
ending in cu, d, s or h provided it expresses collection. 

44 afi ef eR: 2.3.22 ll > through svarita a domain is marked. 

4 TISAI AGATA Sc I 2.3.3 ll > in grammatical instruction, an ac which is anunasika is it. 

46 wfergTS eT: mra: | (KV on 1.3.2). 

47 In the Dhatupatha the listing is sub-categorized under udattah, udatta-itah, anudatta-itah, 
svarita-itah etc. See, for example, (Katre 1989 p. 1173-1200). 
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instruction do not include all the cases.“ Katyayana notes this fact and 
suggests that the group ir added at the end of verbal bases in the Dhatupatha 
should be called it.” Patafijali gives rudh(ir) as an example, and clarifies 
that it is not enough to designate i and r one after another as markers, but 
the entire group ir should be deleted together? It follows that one can not 
depend upon the meta-linguistic rules to decide the it-markers. An explicit 
annotation is necessary. 


The arguments noted above confirm that the strong version of the for- 
malization hypothesis is not tenable. In other words, it would not be possible 
to input the text of Astadhyayi and get as output a formal representation 
which a computer program can understand or implement. This does not 
negate the fact that Astadhyayr is itself an attempt to present the grammatical 
system in a formal manner. Still, it was meant for oral transmission and 
human application?! It would be anachronistic to expect that it should also 
fit the requirements of machines developed some two-and-a-half millenia 
later. Moreover, any such claim would ignore the efforts of both later 
Paniniyas and the works of modern researchers to amend, explain and bring 
consistency in the corpus through several conventions. 


3.2.1 Possibility of formalization of the grammatical system 


The problems faced while attempting to provide a formal representation 
of the corpus of Astadhyayr in a direct manner lead one to look for other 
ways. Here the following differentiations need to be recognized. Firstly, it 
is necessary to differentiate between the system and the framework within 
which this system is articulated. Secondly, it is important to distinguish 
between the oral/ verbal framework in which Astadhyayi is formulated and a 
formal framework which a computer program expects. Finally, it is essential 
to recognize the features of an organizational framework in contradistinction 
to the characteristics of an applicational framework. As these distinctions 


48 The following rules specify it-markers: TIAISAGAIATH gd, ll .3.31l > in grammatical in- 
struction, an ac which is anunasika is it. 857, I 8.3.3 1l > hal coming at the end. 4 fart 
FHT: 12.2.8 ll > but not in vibhakti, the tu, s or m. amfefiged: 2.3.41 > the initial ñi, tu and 
du (of dhatu). 8: XATA 112.2. ll » s of pratyaya. "dg 2.3.9 ll » cu or tu. BAHAI Weve 
> l, é and ku of non taddhita. 

2 V 4on 137. 

50 See (Joshi and Roodbergen 1994 p. 14). Panini refers to entities with ir as it-markers in dl 
aT 13.2.8911 > After irit dhatu replace (c)l[i] through a(n) optionally before parasmaipada 
substitutes of l(u)(n) expressing kartr. 

51 For arguments that the Astadhyayi was developed keeping only oral representational 
apparatus at hand see also (P. 155-163) in (Joshi and Roodbergen 1986 p. 40). 
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are central to my approach for providing a formal representation of the 
Astadhyayi, I examine them briefly. 


The first distinction—namely between the system and the framework— 
corresponds to the difference between the questions as to what is being told 
and how it is being expressed. In the present case the questions would be: 
what is the grammatical system and how is it presented in the grammatical 
corpus. The conventions through which a framework is regulated correspond 
to the meta-linguistic conventions of the Astadhyayi. 


As an example, consider the use of it-markers. In the corpus of Astad- 
hyayi Panini invests specific grammatical characteristics to the constituent 
components by attaching marker sounds to them. The Paninian term for 
these indicators is ^it" (meaning: that which goes away or disappears) and 
the commentators use the term anubandha as well.” The it-markers are not a 
part of the actual form of the components and are deleted unconditionally.” 
Their temporary character is noted by Patafijali who compares them with a 
crow perched on a house: 

If someone asks now, "from among the two houses, which one is the house of De- 

vadatta", one can say "that on which the crow is perching". And later, even if the 


identifying mark of that house is no more, once the crow flies away, the questioner 
recognizes the house.” 


The above remark of Patafijali points out that even after the it-markers are 
deleted, their identifying character is retained or saved by the user in her or 
his memory. Why must they then be deleted at all? Several problems may 
otherwise arise. 


Firstly, marker sounds increase the number of phonemes in a compo- 
nent. This poses a difficulty where, for example, components with only 
one phoneme are allowed. Therefore, a meta-rule is formulated which says 
that while counting the number of phonemes, the markers should not be 
counted.? 


Secondly, markers may influence the final phoneme of a component. 
For example, the root dai(p) in fact ends with ai and not p. It is important 
to delete the marker sound p otherwise the long vowel substitute à can 


52 This term “was chosen for mute significatory letters by ancient grammarians probably 
on account of the analogy of anubandha pasu, tied down at sacrifices to the post and subse- 
quently slaughtered" (Abhyankar 1986 p. 25). Cf., e.g., MB on 1.1.26. 

53 Their unconditional elision is provided by the rule: TA AT: 12.2.9 » its elision (takes 
place). 

54 eR idque VIEW | sme: Fa STU mr: fd | SaR mph ua Wü ae aad wed seda: d Test 
wif I(MB on 1.1.26). Translation with minor variations from (Subrahmanyam 1999 p. 92). 
55 qrgseriqa Select (PS. 6). 
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not replace the final sound ai of the verbal root as prescribed by the rule 
adeca upadese'áit?9. An extra meta-rule is stated to clarify this. It posits that 
while looking at components ending with ec the markers should be ignored.” 


Thirdly, markers also change the form of the components and may cause 
problem while deciding the similarity in form of two components. For 
example, the suffixes (k)a and a(n) have the same form only if one ignores the 
markers. Therefore, an extra meta-rule is mentioned which says that while 
deciding the (dis)similarity of forms, markers should be ignored.?? 


As is evident, these problems are because of the fact that the addition 
of markers changes the form of the components. Grammatical conditions, 
however, take the form of a component without markers into consideration 
and therefore, they must be unconditionally deleted immediately after their 
introduction in the derivational process. Their effect or function, however, 
must be remembered. 


This takes us to the second distinction, namely between the oral and 
formal frameworks. The critical question is how to record or represent the 
grammatical information in a formal framework. The text of Astadhyayt was 
recited and then later put in writing in more than one script. The step which 
is needed for representing oral text in a written corpus needs to be taken 
further to represent it in a formal framework. Continuing with the example, 
the marker sounds which in roman transliterations of the corpus are noted by 
capital letters”, or bold face letters”, or put into brackets“, can be recorded 
as an attribute attached with the corresponding component. 


In the new framework, it-markers are represented as attributes attached to 
the form of corresponding components. Instead of mentioning it at the form 
level and thus mixing it with the form of the constituent components, they are 
placed at the content level together with other attributes.? For example, the 
markers n and k attached with the phonetic form of the suffixes a(n) and (k)a 
are not stored at the form level but at the content level as attributes nit and kit. 


Separating the two levels and maintaining this distinction in the repre- 


~ > oa 


56 Ste SAGRISHAT 1.2.8411 > af replaces ec occuring at the end of verbal roots enunciated 
in the original recitation, except when a suffix with $ as it-marker follows. 

57 AGA Basal (PS. 7). 

58 qrqseqaae SAE (PS. 8). This decision is important for application of the suffixe (k)a 
as an exception to the suffix a(n). See (Subrahmanyam 1999 p. 187). 

5 Sumitra M. Katre's edition of the Astadhyayi (1989) uses this convention as also the vol- 
umes by S. D. Joshi and J. A. F. Roodbergen (1975) etc. 

6° George Cardona (1997) follows this in his edition. 

61 This is used by P. S. Subrahmanyam (1999). 

€ See section 2.2.1 for the form and content levels. 


3.2 Formalization of the Paninian corpus: challenges and possibilities 65 


sentation of grammar has several advantages. The meta-rules mentioned 
in (PS. 6-8) are now redundant. The Paninian rule tasya lopali? becomes 
redundant as well. This is because the marker attribute now being at the 
content level does not change the form of the component. Moreover, there 
are instances when a component with a marker looses its marker and is 
used without it. In other words, sometimes a particular marker is removed 
although originally it constitutes a part of the component. Thus by the rule 
na ktva set the it-marker k is removed. This is easily executed by removing 
the corresponding attribute kit from the component. 


Further, the new representation also makes it possible to take care of 
situations where the characteristic of having a marker is carried over to 
a component, although the marker sound is not expressedly attached 
to it. For example, the rule sarvadhatukamapit® says that a sarvadhatuka 
suffix which is not pit (or not marked with p) is as if nit (or marked with 
n). This amounts to attaching the attribute nit to the appropriate component. 


At this point one may ask whether I am suggesting a new jargon to 
note down the Astadhyayr?? This is not the case for the following reasons: 
firstly, it is not that I am only providing new transliteration conventions or 
merely inputting the corpus into the computer. Such efforts provide for edi- 
tions which only human readers can read, interpret and apply. On the other 
hand, a formal representation renders the grammar in such a manner that an 
algorithm can read, interpret and apply it. Secondly, in order to achieve this, 
the applicational aspects of the grammar need to be incorporated with the 
organizational concerns. Astadhyayr is formulated with the aim of optimal 
organization of the grammatical content and processes. Its application is left 
to the user. This is natural as the prime concern was to record the information 
and since the system is learnt by a student, its application is not explicitly 
stated. 


However, it does not mean that Astadhyayi is oblivious to application. 
The rules are specified in an operational set-up, where the derivational 
process is carried out. The rules of Tripadi, or last three sections of the final 
chapter of Astadhyayi, have a clear procedural thrust as well. 


It is much later in the 14th and 15th centuries CE that the task of reor- 
ganization of the Astadhyayi with procedural application (prakriya) as its 
main focus was taken up. The two most important texts which attempt this 
are the Prakriyakaumudi of Ramacandra (late 14th-15th cent.) followed by 


63 qe SIA: 2.3.3.1 » its elision (takes place). 

64 eara WLR. p» (k)tva together with i(t) looses kit. 

65 adagan 12.3.21! » sarvadhatuka pratyaya, if not pit, are assigned nit. 

6 In the Astadhyaàyi the rules 1.2.1 to 1.2.26 provide for the assignment of it-attributes. 
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the Siddhantakaumudi of Bhattoji Diksita (late 16th-17th cent.). Both of them 
rearrange the sütras of the Astadhyayt for the purpose of derivation of forms.” 


The approaches of grammarians like Ramacandra and Bhattoji Diksita 
however are not without problems. Paninian sütras make sense only in 
their original context. A sūtra is not an independent unit and for the sake 
of application several inter-connected sitras need to be taken into account. 
A reorganization disturbs the original contextual location of the sütras. 
This takes us to the third distinction, namely, how some information is 
organized, i.e. the organizational framework and how it can be put to use, 
i.e. the applicational framework. A formal representation takes care of both 
aspects and records them in an explicit and non-ambiguous manner. This 
facilitates an implementation through a computer program. Keeping the 
above distinctions in mind, I propose the following hypothesis. 


Weak version of the formalization hypothesis: The Astadhyayi in its 
current formulation cannot be formalized directly but the grammatical 
structure which it encapsulates can be reformulated in a formal framework. 
The formalized reformulation can then serve as an input to a computer 
program that can interpret and implement it. 


3.3 Specification of the formal framework 


The discussions in the previous sections show that a formal framework needs 
to be developed in order to represent the content and processes of Astadhyayt 
if one wants logical systems to interpret and apply it. An important feature 
of such a framework has to be a clear distinction and separation between the 
content of the grammar and the manner in which it is formulated. In other 
words, the framework should be independent of the content of grammar. 
It should only facilitate the representation of the structures and the system, 
without in any way influencing them. 


There are two basic concepts upon which the present framework is based. 
The first concept is that any given whole can be analyzed into components. The 
second is the idea that any given entity can be described through a number 
of characterizing attributes. 


While proposing the manner in which the grammatical elements—namely the 
components, attributes and meaning-expressions—should be represented 


67 Compared to the earlier attempt of Ramacandra, Bhattoji Diksita covers all rules of the 
Astadhyayi and follows them consequently. See (Houben 2008 p. 563-574). 
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in the new framework, the way in which they are employed during the 
process of derivation is also taken into account. Similarly, the simulation of 
the derivational process takes into consideration the manner in which the 
grammatical elements are represented. The focus, therefore, is to integrate 
the organizational and applicational aspects of grammar. 


As discussed in section 2.3 the derivational process demands a mecha- 
nism through which the changing forms and contents of the grammatical 
elements can be tracked in a convenient manner. Equally important is to keep 
track of the process of derivation, i.e. the previous stages attained during the 
process, and record them in a manner so that they can be consulted when 
deciding the future steps. Moreover, central to the Pàninian process is the 
interaction with the user. While specifying the formal framework, these 
general guidelines need to be taken into consideration. 


The new framework which I put forward is designed to represent the 
categories and processes outlined in the previous chapter about the gram- 
matical system. Three kinds of basic elements are proposed: 


1. The constituent components p; of the language. 


2. The diverse attributes a; which are either explicitly mentioned or some- 
times implicitly used in the grammar. 


3. The copious meaning-expressions x7” that encode a multitude of infor- 


mation which is external to the grammatical system. 


While attributes and meaning-expressions work at the content-level, the form 
of a component is of utmost relevance for the derivation of the linguistic ex- 
pressions. This is depicted in the following diagram. 
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There are two fundamental processes in this formulation. 


1. Specification of the components. A component is characterized by asso- 
ciating the information from the content level. Thus, some meaning can 
be established through usage p?", or some grammatical attribute ay may 
be attached to it, or some meaning-expression z7' may be associated with 
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it. Further, some specific form—e.g. presence of some phonetic feature— 
characterizes a component as well. 


2. Combination of the components. This involves introduction of new com- 
ponents in the derivational process. 


3.3.1 Representing a component 


The first task is to specify how the constituent components can be represented. 
For this purpose it is necessary to look into how these are formulated within 
the corpus of Astadhyayi and whether the new representation adequately 
incorporates all the aspects of Paninian formulations. 


In section 2.2.1 we have seen that the constituent components p; of a 
linguistic expression x consist of two levels, namely the form and the content 
levels. At the form level these are clearly made up of a sequence of sound 
units. An obvious manner to represent them would be a sequence or list. For 
example, the component bhü can be seen as a sequence of two sounds bh and 
ü which is represented as 


[bh, à] 


Astadhyayi contains "the first enunciation” (upadesa) of the constituent com- 
ponents of Sanskrit.? Looking from the perspective of the form level one can 
speak of two distinct parts in the Astadhyayr according to the two fundamen- 
tal types of the constituent components—the basic sounds and secondly their 
sequences. 


1. In the beginning of the Astadhyay the first part enumerates the funda- 
mental sounds. These listings are called the Sivasütras. 


2. The rest of the Astadhyayi enunciates components of the language which 
are a sequence of one or more basic sounds. These collections are called 
the Sütrapatha, the Dhatupatha and the Ganapatha. 


The listing of vowels in the Sivasütras corresponds to a group of sounds and 
each one of them is further specified by additional attributes. For example, 
the sound a stands for several variations of this fundamental sound, namely 
short (hrasva) /a/, long (dirgha) /a/ etc. If one represents a sound through 
a set, then the general sound /a/ would be represented by {a} and the long 
vowel /a/ by {a, dirgha}. In other words, adding more attributes to the set 


68 The word upadesa refers to the original or first instruction of the grammatical compo- 
nents in the corpus before it enters and changes its form through the derivational process 
(prakriya). For a detailed study on the concept of upadesa in Sanskrit grammar see (Biswal 
1996). 
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which represents a sound would result in further restricting and specifying 
a particular sound. Thus, (a, dirgha, udatta, anunàsika] denotes a particular, 
more specific instance of the vowel /a/. 


A sound can be represented through a set consisting of a fundamental- 
sound and a number of other characterizing attributes. It should be 
noted that within a set representing any sound, there can be exactly one 
fundamental-sound. In the following, I will call such a set a sound-set. 


In the above example, the fundamental sound is /a/ and the characterizing at- 
tributes are dirgha, udatta etc. The Sivasütras provide the set of fundamental- 
sounds. 


la, i, u r, |, e, o, ai, au, h, y, v, r, 1, fi, m, n, n, n, jh, bh, gh, dh, dh, j, b, g, d, d, 
kh, ph, ch, th, th, c t, t, k, p, $$, s, hj 
The form of any constituent component of a linguistic expression can now be 


represented as a sequence of sound-sets. Consider again the example of the 
component bhi. It can be represented as a sequence of two sound-sets: 


bhū = [{bh}, {u, dirgha, udatta}] 


This representation allows one to include any number of attributes that may 
characterize the sounds. The attributes that are associated with a sound in- 
clude phonetic and phonological features.” Thus, one can include other at- 
tributes e.g. the attribute vowel for which the Paninian term is ac. 


bhū = [{bh}, {u, dirgha, udatta, ac}] 


In this manner the information which is specific to a particular sound can be 
incorporated in the corresponding sound-set. 


Some information, however, is shared by more than one sounds. For 
example, the attribute dhatu (verbal root) is valid for both sounds of this 
component. Accordingly, this attribute can be included in both sound-sets. 


bhū = [{bh, dhatu}, {u, ac, dirgha, udatta, dhatu}] 


The sound-sets in a sequence share those attributes that are common to them. 
The presence of the attribute dhatu in both of these sets indicates that both of 
them in this sequence are part of dhatu or a verbal root. 


In the above sequence of sound-sets, thus far, there is no reference to 


© For an introduction to feature systems see (Clark, Yallop and Fletcher 2007 p. 372-390). 
James Stanton Bare (1980) has studied the system of features implicit in the Astadhyayi. 
Interpretations of the Paniniyas on the question of nearness (antaratamya) is discussed by 
Robert A. Hueckstedt (1995). 
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bhū, the original component which is being represented here. This informa- 
tion can be included by adding bhü to both the sound-sets that form a part of 
this component. 


X =[{bh, dhatu, bhū}, {u, ac, dirgha, udatta, dhatu, bhü]] 


The linguistic component can now be summarised as X, and all the informa- 
tion can be gathered by the sound-sets that are contained in it.”” 


The above example leads us to specify a general representation for the 
components. 


X= [(sdi, Q1, 02, eel {sd2, Q1, Q17, Q43, s {sdi, ak, deal AA 


Here, X is a sequence of sets. Each set consists of exactly one key from the 
collection of fundamental sounds. Thus, the first set has sd, the second sd» 
etc. Further, each sound-set may have a number of attributes. Thus the first 
sound-set has attributes a4 and a» etc. It should be noted that the unique 
fundamental sounds in each sound-set represent the basic phonetic form of 
that unit, and the attributes comprehend the characteristics shared commonly 
among several sound-sets or which are individual to a particular sound-set. 
For example, the attribute a4 is common to the first two sound-sets and a is 
unique to the first sound-set.”! 


Within the new framework, each element, ie. the constituent compo- 
nents, attributes or meaning-expressions are assigned a unique identifier or 
an ID. There is an ID 


e for every sound, like /a/ (ID: a 0), /i/ (ID: i 0), /u/ (ID: u 0) etc. 


for the phonetic form of every component: a (ID: a. 2), bhū (ID: bhU a), 
(8)a(p) (ID: zap. 0), ti(p) (ID: tip. 0) etc. 


e for the attributes: vrddhi (ID: vRddhi, 0), dhàtu (ID: dhAtu, 0) etc. 
* for meaning-expressions: vartamüna (ID: vartamAna_x), and 
* for lexical content: balaka (ID: bAlaka x). 


Assignment of an ID ensures a non-ambiguous identification of the elements 
of grammar. For example, the substitute suffix a (ID: a. 2) is distinguished 
from the phoneme /a/ (ID: a 0) on the basis of their IDs. However, for the 
sake of readability, I will not display the underlying IDs. 


70 There are a number of other attributes that are needed to be included in order to have 
complete information about the component bhi, e.g., that it is part of the first group of 
verbal roots etc. 

7! Thus far, only one constraint is mentioned—namely the presence of exactly one of the 
fundamental sounds in each of the sound-sets. In due course, other constraints will be men- 
tioned that are imposed by the grammar to specify particular components. 
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The components of any linguistic expression specified in the Paninian corpus 
can be represented in terms of a sequence of sound-sets. Each of the sound- 
sets in such a sequence consists of exactly one ID from the set of fundamental- 
sounds. Further, it contains a number of other IDs corresponding to other at- 
tributes and meaning-expressions. Moreover, if this sound-set is part of some 
specific component, then this information is also incorporated by including 
the ID of that component with it. Within the new framework, such sequences 
of sound-sets are termed as language-components and can be defined as fol- 
lows. 


A language-component corresponds to any component of grammar and 
is a sequence of one or more sound-sets. 


One of the features of the grammatical system specified in the previous chap- 
ter was the attachment of the attribute to some component. 


bi 


ak 


In the above diagram, the attribute a; is attached to the form of the compo- 
nents pf. The present framework implements the attachment of the attributes 
in the following manner. If the form of some component has the sounds sdi;, 
sd» etc., then the attributes that are common to them are attached with each 
of the constituting sounds and are included in the respective sound-sets. 


pf: sd, Sd» sdi 


a 


ak 


The lexical meaning p?" as well as the meaning-expressions z7' associated 
with some component are recorded in a similar manner. Thus, if a lexical 
meaning is connected with some component, then it is connected to each 
sound-set of that component. 


In the following section I will provide examples to demonstrate that the 
components specified in the grammatical corpus of Astadhyayi can be 
adequately represented in the new framework. Moreover, I will also point 
out how the Paninian articulation differs from the new one, but at the same 
time all the information needed for grammatical processing is aslo included 
in my representation. 
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In the Paninian formulation the fundamental sounds are represented 
directly. The new framework stores them through a sound-set with a unique 
identifier. Moreover, the phonetic attributes are stored explicitly through 
attribute IDs. 


1. The sound /a/ in the Astadhyayi will be equivalent to (a 0]. The long 
variation à is equivalent to (a, 0, dIrgha 0]. Panini attaches the marker 
sound /t/ to restrict the time duration.” Thus Paninian at is equivalent 
to [a 0, dIrgha 0] and at is equivalent to [a 0, hrasva 0]. One can also 
record the Paninian attribute for short or long /a/, namely at or at by 
including the IDs at. 10r At_2 to the corresponding sound-set. Thus (a, 0, 
dIrgha O0, At 2] incorporates the information that the sound is called at 
by Panini. Inclusion of this information is redundant to some extent, but I 
include it to record the Paninian term and to be able to correlate with the 
corpus of the Astadhyayi. 


2. Sounds like the nasal sound m or the aspirated sound h are not listed 
explicitly in the Sivasütra but are referred to in the grammar by their 
names: anusvara for m and visarjaniya for h. The new formulation in- 
cludes the surface form of the sounds named through anusvara or vis- 
arjaniya. Thus, the corresponding sound-sets are (M 1, anusvAra 0] and 
(H 1, visarjanlya 0]. 


3. In the Paninian formulation the constituent components can have addi- 
tional indicatory sounds, namely the it-markers. These are attached either 
at the beginning or at the end of the base form of a component. For exam- 
ple, the suffix ti(p) consists of the sound p as marker at the end. Similarly, 
the infix ($)a(p) has a marker sound $ in the beginning and p at the end. 
Sometimes more than one sound is added at the end or in the beginning, 
for example, yuj(ir) or (du)kri(fi).? In the new framework, the markers are 
included as an attribute. For example: 


ti(p) = [(t, pit}, (i, hrasva, pit}] 
(8)a(p) = [{a, hrasva, sit, pit}] 
yuj(ir) = [{y, irit], {u, hrasva, irit], {j, irit}] 
Thus, ti(p) is a component with the attribute pit i.e. “having p as it". Sim- 
ilarly, ($)a(p) is Sit and pit and yuj(ir) is irit. The sign = refers to partial 
equivalence of the Paninian component in the new formal framework. 


72 ARARSA 62.2.99 Il > an ac followed by the marker t stands for sounds having the same 
time duration. 

73 Vidya Niwas Misra (1966) provides a list of these markers and the “functional load" asso- 
ciated with them. P. S. Subrahmanyam (1999 p. 92-108) uses the term indicators and notes 
them as well. All such marker sounds are represented by ( ) brackets in our writing conven- 
tion. 
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4. Within the Paninian system, a sound can disappear, i.e. it can cease to be 
manifest at the audible level or the form level. The rule adarganam lopali/* 
states that the result of the attribute lopa is “non-appearance” (adaráana) 
of a sound. The same is the case with luk, $lu and lup which are attached 
only to suffixes.” The presence of attributes like lopa, luk, álu and lup 
would imply that the corresponding sound-set is invisible or mute at the 
form level. 


5. Itis sometimes difficult to pronounce the constituent components as they 
consist only of consonant clusters without a vowel. In such cases, a vowel 
is added for facilitating pronunciation. This is necessary for an oral cor- 
pus. For example, the augmentation n[u](m) where the vowel u is just for 
the sake of pronunciation. In the printed edition, these are noted within 
square brackets [ ]. Within the new framework, this information is redun- 
dant although it is stored for the sake of exact reproduction of the compo- 
nents in the original corpus. 


Within the new framework, a component of any linguistic expression is rep- 
resented in terms of a language-component, which is modeled as a sequence 
of sound-sets. These components can be seen as units of an inflected word. 


For example, the word pathati has components path(a), ($)a(p), ti(p) etc. 
Although the derivational manuals like Prakriyakaumudi and Siddhanta- 
kaumudi provide for derivation of individual words, the Paninian system 
of Sanskrit grammar and especially its process of derivation functions not 
at the level of individual words, but at the level of an entire sentence. In the 
new framework, this aspect is taken care of by introducing the concept of a 
sentence and defining it as follows. 


A sentence is a sequence of one or more language-components. 


At this point it is important to mention that the above definition of a sentence 
refers to the collection of language-components within the new framework. It 
should not be confused with the linguistic definition of a sentence and should 
only be viewed as a sort of container in which the language-components can 
be stored in a sequential manner. 


74 agi I: 1.2.40 ll > non-appearance is termed lopa. 
75 FORA oS: 2.2.82 > non-appearance of a suffix is termed luk, élu and lup. 
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3.3.2 Comprehending the dynamics of the derivational process 


Thus far, a framework for representation of the Paninian components has been 
introduced. The grammatical corpus not only provides the constituents of a 
linguistic expression but also the processes to combine them. This synthe- 
sis takes place in a number of steps. Within the traditional Paninian frame- 
work this process is carried out by a person who takes decisions regarding the 
choices of introducing the new components. As mentioned earlier, the deriva- 
tional process is guided by 


1. the intention (vivaksa) of the speaker, and 
2. the constraints of the grammatical system, which involves 
a. consistency of the derivational state 
b. its saturation, and 
c. its completion. 
The new framework takes care of both the human and the grammatical 


aspects of the derivational processes. 


Consider the example of derivation of the Sanskrit expression balakali 
pathati (a boy reads) worked out in the previous chapter. The first step is the 
introduction of the constituent components bàlaka and path(a). Their repre- 
sentation in the new framework would be in terms of a sentence consisting 
of two language-components: if X1 is the language-component for balaka 
and X2 for path(a) then depending upon their mutual order, the sentence 8 is 
given by a sequence of X1 and X2. 


S = [x1, x2] 

Further, each language-component is a sequence of sound-sets: 

e X1 =[{b},{a, dirgha}, {l}, (a, hrasval}, {k}, (a, hrasva}] 

e X2=[{p}{a, hrasva}, {th}] 
For better readability, I note the language-components with their respective 
sound-sets in the following manner: 
S = [X1, x2] 
X1 ss1 fb, balaka} 

ss2 (a, dirgha, balaka} 

ss3 {l, balaka} 
{ 


ss4 {a, hrasva, balaka} 
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k, balaka] 


ssb { 

ss6 (a, hrasva, balaka} 

X2 ss1 fp, path(a), ait, udattet} 
ss2 (a, dirgha, path(a), ait, udattet} 
ss3 (th, path(a), ait, udattet} 


The sound-sets consist of a number of IDs. As mentioned earlier, an im- 
portant constraint is that each sound-set must contain exactly one ID from 
among the set of fundamental sounds. For example, a sound-set like (a, i} is 
invalid. One can say that the collection of fundamental sounds forms a set of 
mutually exclusive IDs. Another example of a set of mutually exclusive IDs 
is the set {hrasva, dirgha, pluta}. Within a particular sound-set the presence 
of an ID from this set excludes the possibility of inclusion of any of its other 
IDs. One can introduce the concept of consistency of a sound-set. One such 
condition of consistency would be that a sound-set should not have more 
than one ID from the set of mutually exclusive IDs. 


There is a separation of the form and the content level. The language- 
component X2 represents path(a). In the Pàninian formulation it has four 
sounds / p a th (a)/ where the last one is only a marker, which is uncondi- 
tionally deleted by the rule tasya lopah”®. In the new formulation, the marker 
sound is not represented through an extra sound-set but is included as an 
attribute (ait) at the content level. Thus, only as many sound-sets are formed 
as there are sounds that contribute to the form of the linguistic expression. 


Attachment of the attribute dhatu. The rule bhavadayo dhatavah says 
that the components mentioned in the list beginning with bhü—i.e. the 
list of verbal roots or Dhatupatha—be assigned the attribute dhatu. In the 
new framework this assignment is carried out in the following manner: the 
sound-sets of the language-components are checked and if an ID which is also 
part of the set of the verbal roots is present, then that sound-set is attached the 
ID corresponding to the attribute dhatu. This can be formulated as follows: 
given a language-component Xi of some sentence S, if it contains one of the 
IDs belonging to the set (bhà, ..., path(a), ...}, then attach the attribute dhatu. 


S = [X1, x2] 
XT. 
X2 ssi fp, path(a), ait, udattet, dhatu} 
ss2 (a, dirgha, path(a), ait, udattet, dhatu} 


76 TAAT: 12.2.8 1 p its elision (takes place). 
77 FATA WI: 12.2.2 ll > components in the list beginning with bhū are dhatu. 
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ss3 (th, path(a), ait, udattet, dhatu} 


Here it should be noted, that in the new framework there is no need to 
introduce extra indices for different constituents as the IDs of components, 
attributes and meaning-expressions provide the requisite identification. So 
in this case, each sound-set of the language-components will be checked for 
whether it has some ID (in this case path(a)) belonging to the set of the IDs 
of dhatus, and if this is the case then the attribute dhatu is added to that 
sound-set. 


Similarly, attachment of other attributes e.g. pratipadika, kartr, pratyaya, 
vrddhi etc. would bring about inclusion of the attribute IDs to the appropriate 
sound-sets. The selection of the appropriate sound-sets would depend upon 
examination of different conditions. Thus, attachment of an attribute results 
in addition of the IDs to one or more sound-sets. 


Adding a new component l(a)(t. Addition of a new component involves 
extension of the sentence by adding new language-components. The rule var- 
tamane lat? introduces l(a)(t) if present tense is intended. The derivational 
state looks as follows: 
S = [X1, X2, X3] 
X1 ... 
X2 ssi fp, path(a), ait, udattet, sakarmaka, dhatu} 

ss2 (a, dirgha, path(a), ait, udattet, sakarmaka, dhatu} 

ss3 (th, path(a), ait, udattet, sakarmaka, dhatu} 
X3 ssi {l, l(a)(t), tit} 


Here, a new language-component X3 is added after X2. For this, the indices 
of the language-component with dhatu need to be identified and the new 
component should then be introduced after it. 


Replacement of l(a(t) by ti(p). Replacement of a component by an- 
other can now be represented as a combination of addition and attribute 
attachment. Thus, in order to replace l(a)(t) by ti(p), the language-component 
X4 corresponding to ti(p) is added after the language-component X3, which 
corresponds to l(a)(t). Further, a new attribute ó (denoting "replaced") is 
attached to X3. The derivational state looks as follows: 


S = [X1, X2, X3, x4] 
Xi... 


78 ada BE 13.3.?311l > to express present time, introduce l(a)(t). 
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X2 ... 
X3 ssi {l, l(a)(t), tit, abhihita-kartr, 6} 
X4 ssi ft, ti(p), pit, prathama, ekavacana} 
ss2 (i, hrasva, ti(p), pit, prathama, ekavacana} 


It should be noted that the component l(a)(t) which has been replaced is not 
dropped from the sentence. Instead, a new attribute 6 is attached to it. In 
this manner the information which it carries remains available for all further 
steps. In the traditional framework, owing to its oral and linear nature, 
the substituted component needs to be removed and in its place, the new 
replacement is placed. 


The process of derivation progresses through a number of derivational 
states. A particular derivational state represents the effect of some gram- 
matical operation on the current sentence. The grammatical operations are 
enjoined by the operational rules of grammar. Within the new framework, 
the operational rules of Astadhyayi are modeled as statements (see next 
chapter). 


A derivational state, therefore, stores the effect of a statement when 
applied on a sentence. 


The above example indicates that from the perspective of the new framework, 
there are two basic types of operations which the operational rules enjoin: 


1. Attachment of an attribute to some language-component or a sound-set. 
2. Addition of a component to some language-component. 


The first category corresponds to the process of saturation of a derivational 
state and the second one for its completion. A number of saturating state- 
ments are applied giving rise to a sequence of derivational states. A container 
is required to collect these derivational states. Such a sequence of derivational 
states is termed a slice. Thus, a slice is defined as follows. 


Slice is a sequence of derivational states. 


Attributes are attached as long as the level of saturation is not reached. Once 
no more attribute can be attached to a particular derivational state, then the 
current slice attains saturation. Then, depending upon the conditions, a new 
component can be added which prompts an introduction of a new slice. As 
long as new components can be introduced, the process remains incomplete. 
Again, a container is required to collect the slices. Once no more component 
can be introduced it attains completion. 
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The sequence of slices is collected in a process-strip. Thus, a process-strip 
is defined as follows. 


Process-strip is a sequence of slices. 


To sum up, the entire derivational process is modeled through a 
process-strip which is a sequence of slices. Whenever a new com- 
ponent is added, a new slice is introduced. Within a slice, there is a sequence 
of derivational states. Each derivational state corresponds to the attach- 
ment of some attribute. Finallly, a derivational state stores the effect of some 
operational statement on a sentence. Further, a sentence is a sequence of 
language-components which in turn consists of a sequence of sound-sets. 


Chapter 4 


Representations 


In the previous chapter I proposed a new framework and it is time now to rep- 
resent the operational rules of Astadhyayr in this new framework. In contrast 
to the sütras of Astadhyayi which are meant to be interpreted and applied by 
a learned student of grammar, the operational rules are formulated in terms 
of statements. In the following, I will take a representative selection of the 
operational rules of Astadhyayi and render them in the new framework. The 
task to represent all the operational sitras of Astadhyayi would require more 
time and space than I have at my command at this moment. My claim, how- 
ever, is that the operational rules of the entire Astadhyayi can be represented 
in this manner. 


4.1 Operational statements 
4.1.1 Characterizing the components 


Consider the very first sūtra of Astadhyayr: vrddhiradaic. It says that at and 
aic are termed vrddhi. Traditionally, it is a definition (safijfia-sütra) specifying 
the technical term vrddhi. In the new framework it is seen as an operational 
statement that provides for attachment of the attribute vrddhi to those sounds 
that contain the attribute āt or aic. It is specified in the following manner. 
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*a: vrddhi 
1. Xm+a || Xm[&t, aic] 


aS 


JST 12.2.2 ll » vrddhi stands for at and aic. 


The above statement consists of three lines. The first one is the header 
where the variable *a represents the type of operation, namely attachment 
of an attribute. This is followed by a semi-colon : and then the value i.e. the 
ID of the attribute which is to be attached. The attribute whose attachment 
is ordained by this rule is vrddhi. So the first line says that this statement 
provides for attachment of the attribute vrddhi. 


The second line specifies that the attribute a is meant to be attached to 
a sound-set Xm. This is noted as Xm*a. Here, Xm is the m-th sound-set of some 
language-component X. This is followed by two parallel vertical lines || which 
separate the actual listing of the conditions. The condition in this case is noted 
as Xm [at, aic]. It means that the sound-set Xm should have either the ID: at or 
the ID: aic. The second line is numbered as 1. The need for numbering will 
become clear once we meet cases where more than one mutually exclusive 
condition may trigger the operation. The core of the statement consists of the 
first two lines. It can be read as follows: attach an attribute with value vrddhi. 
The attachment is to a sound-set. This sound-set must contain either the ID: 
at or the ID: aic. 


The third line which follows is useful for quick reference. It contains 
the original Pàninian sütra which corresponds to the above statement to- 
gether with an English translation. 


The above statement is formulated within the formal framework intro- 
duced in the previous chapter. Thus, it provides for attachment of an 
attribute to a sound-set. Moreover, the condition which needs to be fulfilled 
is also specified in terms of the presence of certain IDs in that sound-set. 


Two separate layers can be identifed in the above statement. Firstly, its 
structure or form and secondly its values or content. The structure part is noted 
in the typewriter font. It is: 


*a:— 


Xm*a || Xn [—] 


The content or values are the attributes vrddhi, at and aic. Thus, there is a 
clear separation between the form or structure of a statement on the one hand 
and its content or values on the other hand. 


There are certain conventions employed in the new formulation. Some 
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of them correspond to the logical or structural properties of the statement 
and others are my choices for displaying them in print form. I collect and list 
these conventions in the grey boxes. 


1. X stands for a language-component. 

2. Xm stands for the m-th sound-set of some language-component X. 

3. +a stands for the type of operation, namely attachment of an at- 
tribute. 

4. Xm*a says that the attribute a is attached to the sound-set Xm. 

5. Xn[ID1, ID2] represents the condition that the sound-set Xm should 
have either ID1 or ID2. 


The next sūtra of Astadhyayr is similar. In the grammatical corpus it is stated 
as adengunah. It says that at and ert sounds are called guna. Its representation 
in terms of the new framework will be as follows: 


+a: guna 
1. Xmta || Xm[at, eri] 
weg: 8.8.21 > at and eri are guna. 


This statement can be read as follows: attach an attribute with value 
guna. The attachment is to a sound-set. This sound-set must contain either 
the ID: at or the ID: eri. 


Now, if one compares the structure of this statement with the previous 
one, it becomes clear at once that both the statements have the following 
identical structure. Only the values of the IDs have changed. 


+a:— 
Xm+a || Xm [—] 


The identical structure of the above statements tells us that from operational 
point of view, they are identical. In other words, both the statements can be 
implemented through a single operational function. In fact, the operation of 
attachment of an attribute to some component is a fundamental operation in 
the Paninian system. Within the Paninian corpus this operation is formulated 
and presented through different techniques. We list them in the following sec- 
tions. 
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4.1.1.1 Explicit enumeration 


A straight forward way to form a set is to enumerate its members explicitly. 
Panini uses this technique extensively, where the components are listed 
together. As an example, consider the statement : 


+a: nistha 
1. X*a || X[(k)ta, (k)tavat(u)] 
arad FAST 2.2.28 Il > (K)ta and (K)tavat(u) are nistha. 


Here the components (k)ta and (k)tavat(u) which form the set called 
nistha are mentioned explicitly. Again, the structure of this statement, 
namely 


*a:— 
X*a || XL—1 


is similar to the strucutre of statements attaching the attributes vrddhi or guna, 
namely: 


+a:— 
Xm+a || Xm [—] 


There is, however, one important difference. This statement provides for 
attachment of the attribute nistha to the entire language-component, while 
the previous statements provide for attachment of attributes like vrddhi 
or guna to a particular sound-set. Further, there are significant differences 
in the nature of values. While all the statements provide for attachment of 
some attribute, the conditions in the first two statements are formulated 
in terms of other attributes (like at or aic), while in the case of the third 
statement, it is based on other fundamental components, namely (k)ta and 
(k)tavat(u). One can, at this point, speak of attributes that are attached to 
a sound-set or phonological features and those that are associated with a 
language-component or morphological characteristics. 


Another example of attribute attachment to a language-component is: 


+a: gha 
1. X+a || X[tara(p), tama(p)] 
ACTA T: WAM > tara(p) and tama(p) are gha. 


Apart from explicit enumeration of memebers of a set within the main 
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corpus, namely the Sütrapatha of Astadhyayi, there are extra lists or sets 
of components collected in the two large appendices—the Dhatupatha and 
the Ganapatha. Within these large lists there are several sub-groups as well. 
One common way of naming such a list is by mentioning its first element 
like bhū and then forming the compound beginning with bhū or bhvadi. The 
corresponding statement would be: 


*a: bhvadi 
1. X+a || X[bhü, path(a), ..., ji, pa), dhe(t), ...] 


The form of this statement is identical to the one for attachment of the attribute 
nistha. It says that whenever a language-component has any of the IDs for 
bhü, path(a) etc., then the attribute bhvadi should be attached to it. All such 
sets that are specified in the grammatical corpus by explicit enumeration of 
their elements can be formulated within the new framework in a similar man- 
ner. For example, the statement for the next large set of verbal roots would be: 


+a: adadi 
1. X*a || X[ad(a), han(a), dvis(a), duh(a), dih(a), ...] 


In both cases above, i.e. attachment of bhvadi or adadi, there is no mention of 
any sūtra of Astadhyayr. This is because these statements do not correspond to 
any sūtra which is uttered explicitly in the grammatical corpus. Thus, the new 
formulation includes information that is specified in the grammatical corpus 
even if it is not mentioned in an explicit sūtra of the Astadhyayi. 


4.1.1.2 Formation of sigla (pratyahara) 


The fundamental sounds are listed in what are called the Sivasütras.! 
Compared to the listings of the sounds in Pratisakhyas, there are two novel 
features in Pànini's enumeration. Firstly, there is a new sequencing of the 
sounds and secondly, there are delemiting markers placed between them. 
Following is the list of sounds in the Sivasütras : 


aium|rlkjeonm|aiauc|hyvrft|l1g| &mnnn m|jh bh ñ| gh dh 
dh s|jbgdds|khphchththcttv| kpy|$ssr|hiIl 


1 The terms used for the list of units or phonemes are aksarasamamnaya (in Mahabhasya) 
and Sivasütra or Maheévarasütra in later tradition. Nandike$vara mentions in his Kasika 
that the lord Siva sounded his drum (dhakka) fourteen times revealing the fourteen sütras to 
Panini. Hence the name Sivasütra or Mahesvarasiitra. See (Cardona 1997 p. 83). Important 
studies on the Sivasütras include (Breloer 1935 p. 133-191), (Cardona 1969 p. 3-48), (Staal 
1962 p. 1-10), (Kiparsky 1991a) and (Petersen 2004 p. 471-489 and 2009 p. 79-98). 


84 4 Representations 


This sequence is not according to the usual order found in the Pratisakhyas 
on the basis of place of articulation? Moreover, certain consonants are 
inserted as delimiters. They are placed at particular positions indicating the 
final sound of a sütra.? Using delimiters, the groups of sounds can be named 
in a generic manner. For this, one begins from the initial non-delimiter sound 
and continues upto the final delimiter sound. All the non-delimiter sounds 
contained in this sequence belong to that group.* 


The purpose of the special ordering as well as use of delimiters is to 
form groups and sub-sequences required for the functioning of grammar? 
Katyayana notes that "the teaching of the sounds is to provide a special 
ordering for the sake of functionality of rules". Tradition uses the term 
pratyahara (lit. bringing together) for the groups formed in this manner using 
a marker sound as the end delimiter.” 


Panini performs the task of forming groups in a generic manner. In- 
stead of enumerating individual groups, he provides a rule-based method 
for forming and naming them. The reason for this way of forming groups 
is brevity (laghava)? The technique employed for forming sets of sounds is 
employed for formation of groups of components as well. Examples include: 


? Compare (RVPr. 1.1-14). B. Breloer (1929 p. 114-135) provides a detailed study of the or- 
ganization of phonemes on the basis of place of articulation in comparison to the different 
ordering in the Sivasütras. See also (Cardona 1965 p. 225-237). 

3 These are also called it-markers. Usually the it-markers are attached to some component 
and this attachment is regulated by a set of meta-linguistic rules. In case of Sivasütras how- 
ever, they are placed as delimiters. In order to distinguish them from the it-markers appear- 
ing in the components, we write them in italics and not in ( )-brackets. Cardona (1969 p. 12) 
employs a more appropriate formulation here than Joshi and Roodbergen (1986 p. 188), 
“placed” instead of "added", as the it-markers are not attached to the previous sounds, 
they are just placed after them. They do not, for example, make the sound uin aiu | a nit 
sound. P. S. Subrahmanyam (1999 p. 88-91) recognizes this in his notation of pratyaharas. 

4 This is specified by the rule: MRA FÈT 12.2.9? ll > an initial element together with the 
final it sound includes intervening elements. 

5 The process which is at work here is to form sub-sequences. Given a sequence, a number 
of sub-sequences can be formed, not all of which are relevant for grammar specifications. 
Given a finite alphabet =" upto 2” sub-sequences, (7) of length k can be generated, which 
need not all be different. The contiguous sub-sequences which can be generated are Bin 
in number. Panini, however, employs 41 sub-sequences. There is a 42nd pratyahara, namely 
cay, mentioned by Katyayana in his Varttika 3 under the rule TRAR FACT c.v. vc Il. See 
(Subrahmanyam 1999 p. 90). 

6 qfi guest: (PV. 19). See (Joshi and Roodbergen 1986 p. 186-187). 

7 Panini does not use this term. See (Abhyankar 1986 p. 266-167). 

8 Devasthali (1967) is a detailed study on the anubandhas of Panini. The structure of the 
Sivasütras and its explanation on the basis of brevity is worked out by Staal (1962 p. 1- 
10), Cardona (1969 p. 3-48), Kiparsky (1991a) and Petersen (2009 p. 79-98). Wiebke Petersen 
(2008) has proved using mathematical methods that the listing of units and the choice of 
positioning of markers are optimal. 
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tin = [ti(p) tas jhi si(p) thas tha mi(p) vas mas ta atàm jha thas atham dhvam 
i(t) vahi mahi(n)]. 
tari = [ta atam jha thas athàm dhvam i(t) vahi mahi(n)] 
sup = [s(u) au (j)as am au(t) ($)as (t)à bhyam bhis (n)e bhyam bhyas (n)as(i) 
bhyàm bhyas (n)as os 4m ni os su(p)] 
sut = [s(u), au, (j)as, am, au(t)] 


Here the non-marker portion of the first component is taken as the initial 
part, and the last marker sound of the final component of the sequence as the 
final part. 


Pratyaharas are names of groups and sub-sequences. This indicates that 
they belong to the category of attributes. Forming groups of sounds and 
naming them implies that each element of that group is assigned an attribute, 
namely the name of that group. Thus the sounds a, i, u are assigned the 
attribute an, since they form the group (a, i, u}. The rule tasya lopah? which 
is applied for elision of the marker sounds does not apply in case of markers 
appearing in these names. It is for this reason that in my notation the marker 
sounds originally coming from a component, but now forming the name of a 
group, are not placed within brackets but are denoted by italics. 


In formal representation, pratyaharas and other names for groups of compo- 
nents are attributes that are attached to the respective sounds or components. 
+a: ac 

1. Xm+a || Xn[a, i, u, y, | e, o, ai, au] 


SR SUL > aj i u, (n) E E UM mr | (k). CAT all we, o, (0). tat 
UU > ai, au, (c). Mees FEAT 12.2.92 ll » an initial element together with 
the final it sound includes intervening elements. 


+a: tin 
1. X«a || X[ti(p), tas, jhi, si(p), thas, tha, mi(p), vas, mas, ta, atam, jha, thas, 
atham, dhvam, i(t), vahi, mahi(n)] 
weeds AEM 2.2.91 > an initial element together with the final it 
sound includes intervening elements. Nakakatanga Ag RA- 
f&& NRVC I »- ti(p) tas jhi si(p) thas tha mi(p) vas mas ta atam jha thas atham 
dhvam i(t) vahi mahi(n) are the substitutes of the cover term | or a lakara. 


Sometimes the order of the sounds within a pratyahara is important. 


9 qeq SI: 12.3.2 > its elision (takes place). 


86 4 Representations 


Consider, for example, the rule eco’yavayavah'® where the order of sounds 
[e, o, ai, au] in ec must necessarily be maintained to correspond with the 
components [ay, av, ày, àv]. Sometimes, however, the order of sounds within 
a pratyahara is of no importance. For example, in the rule vrddhiradaic! the 
order of (ai, au] in aic does not need to be fixed. 


The attributes at, at, it, üt etc. characterize the respective length of the 
vowel. For example: 


ta: at 
1. Xm*a || Xn[a][dirgha] 


AREAS !2.2.991l mB an ac followed by the marker t stands for sounds 
having the same time duration. 


The condition clause Xm[a] [hrasva] specifies that both the IDs, namely 
the one corresponding to a and the one to hrasva, must be present in the 
sound-set Xm. The following convention is used here: 


6. Xm[—] [—] says that both the variables within the two square 
bracket must be simultaneously present within the sound-set Xm. 


4.1.1.3 Use of attributes 


Within the grammatical corpus, the names of smaller sets are used to consti- 
tute a larger one. For example, the rule tin sit sarvadhatukam specifies that the 
elements of the sets tiñ and sit together form the set called sàrvadhatuka.?? 
Now, as mentioned above, tiri is a set of 18 suffixes and Sit also is the name of 
the group of components having $ as it-marker. This can be seen as set union 
where a larger set is formed on the basis of the union of two or more smaller 
sets. 


+a: sarvadhatuka 


1. X+a || X[tiz, Sit] 


10 qsqa: 8.2.96 ll » ec = [e o ai au] are replaced by ay av ay àv respectively when ac 
follows. 


aS 


1 FRAT 2.2.2 ll > vrddhi stands for at and aic. 

12 Because of the anuvrtti of dhatoh 3.1.91, strictily speaking, these suffixes are termed sar- 
vadhatuka once they are introduced after a verbal root. But for the sake of simplicity, I am 
not taking this into consideration here. 
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fae [ENS ESIE DIEA l| à.9.2?31l > ti and those having $ as it marker are called 
sarvadhatuka. 


The above representation says that if a language-component has either 
tiri or Sit as its attribute, then the attribute sarvadhatuka should be attached 
to it.P 


4.1.1.4 The it-markers 


The it-markers attached to a component give rise to a set which is named as 
the set of components with a particular it marker. For example, $it components 
are those that have 6 as it, like ($)a(p), (c)ana($) etc. These are represented by 
including the attribute sit in the corresponding sound-sets. Thus, the new rep- 
resentation of the component (§)a(p) would be: 


X = [(a, ($)a(p), Sit, pit, ...}] 


The primary list of the components records this information about the 
it-markers by including attributes like sit or pit in the new representation of 
the component ($)a(p). 


Sometimes components not originally marked with it-markers are as- 
signed markers through specific rules. At other times, the characteristics 
of having a marker is removed in certain instances. We will address such 
dynamic assignment of it-markers later in sections 4.1.1.12 and 4.1.1.13. 


4.1.1.5 Groups based on some identifying sound 


Sometimes the set of components is specified by an indicatory sound which is 
shared by all the elements of the group. For example, in the rule dadhaghvadap 
the intended components are the ones having the form dà or dhà with the ex- 
ception of da(p) and dai(p) i.e. (du)da(fi), (du)dha(fi), da(n), de(n), do, dhe(t). 
Here the formulation of sütra is not explicit and additional explanations are 
required to interpret it. There is no precise specification of the elements of 
the group. The intended components need to be enumerated in an explicit 
manner as follows: 


+a: ghu 
1. X*a || X[(du)da(fi), (du)dha(fi), da(n), de(n), do, dhe(t)] 


13 In fact, here the condition that it must be a suffix or pratyaya which is introduced after a 
dhatu or verbal root is not mentioned. This is for the sake of simplicity and explanation. 
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qaq, 2.2.20 > (du)da(f), (du)dha(f), da(n), de(n), do and dhe(t) are 
ghu. 


Thus, the instances where an explicit enumeration is sought, but not specified 
in clear terms are determined by exhaustive listing of the components meant 
to be listed. 


4.1.1.6 Specification of the boundaries 


A usual method employed in the Sütrapatha of the Astadhyayr is to specify 
the boundary within which the components that are mentioned are assigned 
some attribute. For example, the rule pragrisvarannipatali^ states that the 
components enumerated prior to risvare in the rule adhiri$vare? are termed 
nipata. Sometimes the boundary is not mentioned explicitly, but is indicated 
by the domain or adhikara. For example, the attribute pratyaya is assigned 
on the basis of mentioning the domain in the rule pratyayah'®, which says 
that the components introduced hereafter till the end of the fifth chapter are 
called pratyaya. It should be noted that there is no separate enumeration 
in this case, but the components are specified together with the conditions 
under which they are introduced and the associated operations that need to 
be carried out. Explicit statements are formulated in the new framework to 
attach such attributes. 


*a: pratyaya 
1. X«a || X[sa(n), (k)ya(c), kamya(c), ..., K(aXt), ..., ti(p), ..., (Sa(p), ...] 
IAA: 12.8.2 ll > pratyaya are components introduced subsequently. 


4.1.1.7 Formation of groups of attributes 


Panini not only forms sets of components, he also defines the collections 
of attributes. For example, the attribute karaka is the name of the group of 
attributes {kartr, karman, karana, sampradana, apadana, adhikarana]. 

*a: karaka 


1. X+a || X[kartr, karman, karana, sampradàna, apadana, adhikarana] 


14 GEAR: 00.9.45 Il > Before the rule 97 the units introduced are assigned nipata. 
15 ARIAT 2.4.99 ll > adhi is assigned karmapravacaniya when it conveys lordship. 
16 JA: 8.2.2 ll > pratyaya are components introduced subsequently. 
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It should be noted that although the semantic definition of karaka is not con- 
sidered here, as the Astadhyayi does not provide it in an explicit manner 
through some meaning-expression, the term karaka stands for the group of 
attributes mentioned above. 


4.1.1.8 Specifying the complementary set 


Panini also specifies a set by defining it as complementary to some other sets. 
For example, in the rule arthavadadhaturapratyayah pratipadikam!" the set of 
nominal stems (pratipadika) is defined as consisting of components that are 
meaningful (arthavat) and do not belong to the sets of verbal roots (dhatu) 
and suffixes (pratyaya).'® 


+a: pratipadika 
1. X«a || XM[arthavat] ^ XNOT[dhatu] ^ XNOT[pratyaya] 


HAATTGLITA: TATRA 02.3.2809» a meaningful component which is not a 
dhatu and not a pratyaya is pratipadika. 


All the above methods are different ways to specify the formulation of 
sets or sequences of components. The attribute in these cases is the name of 
the group and is attached to each element that belongs to this group. 


Theoretically it is possible to specify them in an explicit and precise 
manner by enumerating the set. Although, in the oral framework of Panini 
there are some interesting and elegant ways to formulate formation of a 
group, it is neither precise nor explicit in all cases. In the new representation, 
all such instances are specified explicitly. 


4.1.1.9 Naming the distribution of components 


Apart from group formations, attributes also signify a specific distribution of 
the components in the derivational process. Consider the attachment of the 
attribute samyoga. It is stated as follows: 


17 HIATT GAA: TATA NA > a meaningful component which is not a dhatu and 
not a pratyaya is pratipadika. 

18 For the sake of simplicity we leave here other conditions mentioned in the rule #afad- 
SAMIA IL $.3.98 ll > components ending with krt or taddhita or those which are samasa as 
well. 
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*a: samyoga 
1. XmXn+a || Xm[hal] ^ Xn[hal] ^ IDX [n2m*1] 


TASAA: FART: 12.2.9 11 > hal in contiguous sequence are samyoga. 


Here, XmXn*a specifies that the attribute a should be attached to both 
the sound-sets, namely to Xm as well as Xn. Thus samyoga is attached to 
both the sound-sets. The condition Xm [hal] states that the sound-set Xm must 
contain the attribute hal. Similarly, Xn [ha/] says that the sound-set Xn must 
also contain hal. Moreover, IDX [n=m+1] specifies that the indices m and n are 
contiguous. The logical ^ sign specifies that the three conditions are related 
with the logical AND operator. The new conventions that are needed here are: 


7. XmXn are consecutive sound-sets (phonemes). The order is FIXED. 
8. IDX[n=m+1] says index n is equal to the index m*1 i.e. n is the next 
contiguous index following m. 


It should be noted that this type of convention is a personal choice of the 
editor to note down the eventualities. However, from the point of view 
of complexity of grammar formulation, the concept of consecutive sounds is 
important. 


The form or structure of this statement can be specified as follows: 


ba 
XnXn«*a || Xm[—] ^ Xn[—] ^ IDX[—] 


The form of this statement is different from forms of the previous statements. 
This implies that the attachment of an attribute like samyoga needs a very 
different operational function than attachment of attributes like vrddhi, 
guna etc. In this case, it is the nature of the adjacent sound-set which would 
decide whether the two adjacent sound-sets can be assigned the attribute 
samyoga or not. This aspect of positional distribution requires the use of 
IDX[—] clause. Here, however, there are no numbers as indices, but only the 
constraint that the sound-sets Xm and Xn need to be adjacent. 


Another example is the rule aco'ntyadi ti. It assigns the attribute ti to 
that part of the sound sequence which begins with the last vowel. Thus, the 
attribute ti is attached dependent upon the distribution of the language- 
component. It is specified by the statement: 


*a: ti 


1. Xbeta || Xb[ac] ^ XmNOT[ac] ^ IDX[m-(b*1 TO e)] 
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Asen È 1 2.2.8¢ ll > that part which begins with last ac is ti. 


This states that the attribute ti is attached to a sub-sequence of the language- 
component X, namely from the sound-set Xb upto the sound-set Xe. Moreover, 
Xe is also the final sound-set of that language-component. Further, the initial 
sound-set of the sub-sequence should have the attribute ac and the subse- 
quent sound-sets should not contain the attribute ac. In the representation 
above, the suffixes be denote a range from Xb upto Xe. Thus, Xbe represents 
the sub-sequence [Xb, Xb+1, ..., Xe]. Further, Xn where the index IDX ranges 
from m = b*1 TO e says that the sub-sequence [Xbt1, ..., Xe] of sound-sets 
is meant. Here, the necessity of checking a particular range of sound-sets 
indicates that the assignment of the attribute sometimes depends upon the 
distribution of various components in a derivational state. The following 
new conventions are introduced: 


9. Xbe represents the range of sound-sets from Xb till Xe where Xe is the 
final sound-set of that language-component. Xm is a variable sound- 
set which ranges from the index b+1 till e. 
10. XmNOT denotes that the sound-set Xm should NOT contain any of the 
IDs mentioned in this condition clause. 


The next attribute upadha is attached to the penultimate sound-set of a 
language-component. 


+a: upadha 
1. Xuta || Xu[al] ^ Xe[al] ^ IDX [u=e-1] 


HAHA STA N.28 > an al which is penultimate is upadha. 


The subscript e in Xe indicates that it is the final sound-set of the language- 
component. Moreover, the condition under the IDX clause provides that Xu is 
the penultimate sound-set. As a convention: 


11. Xu is reserved for the penultimate (upadha) sound-set. 


As the next example, consider the definition of laghu and guru. In the 
Astadhyayi it is specified through the following three sūtras : hrasvam laghu, 
samyoge guru, dirgham ca. Together, they say that a hrasva sound (short vowel) 
is laghu (light), except when samyoga (consonant cluster) follows, then it is 
guru (heavy), as also a dirgha sound (long vowel). 


From the perspective of the new formal framework, there are two in- 
stances of attachment of attributes, namely laghu and guru. Accordingly, 
there are two different statements. Consider first the attachment of the 
attribute laghu : 
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+a: laghu 
1. Xmta || Xn[hrasva] ^ XnNOT[samyoga] 


eet Sy lt9tell » hrasva is laghu. AÌ Te 12.8.2211 > when samyoga 
follows, it is guru. 


Both the sütras 1.4.10 and 1.4.11 mentioned above need to be taken into 
account. It has to be made explicitly clear that a hrasva sound is only then 
laghu when samyoga does not follow. This information is implicit in the 
oral formulation. XnNOT[samyoga] notes this explicitly. It says that the 
next following sound-set Xn should not contain the ID samyoga. The two 
conditions are joined by the logical AND operator, which is represented in this 
book through the ^ sign. Moreover, a convention is used here, namely that 
Xm and Xn represent two consecutive sound-sets. 


The statement for attachment of guru is as follows: 


*a: guru 
1. Xmta || Xn[hrasva] ^ Xn[samyoga] 
2. Xmta || Xn[dirgha] 


aed c 2.8201 > hrasva is laghu. Gant TS 02.9.22! » when samyoga 
follows, it is guru. d "d I 9.9.23 Il > dirgha as well. 


The conditions are now listed in two separate lines. They represent two 
disjoint cases when the attribute guru is to be attached. The first is when 
the sound is hrasva (short vowel) and is followed by samyoga or consonant 
cluster, and the second case is when the sound itself is dirgha (long vowel). 
The individual lines contain the conditions that can be combined together 
with the logical OR operator. The reason for keeping them separate is not 
just for the sake of readability, but to distinguish between disjoint sets of 
conditions. This, as will be made clear, is important for noting the complexity 
of the conditions in an explicit manner (see section 4.2.2). 


4.1.1.10 Associating attributes with meaning-expressions 


The cases mentioned thus far are all dependent in one way or another upon 
the grammatical information stated in the grammatical corpus. The next big 
category of attribute assignment is through semantic characterization. This is 
usually achieved by associating the attributes with a meaning-expression. 
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The following statement associates the expression mukhandasikavacana 
(an utterance spoken simultaneously through mouth and nose) with the term 
anundsika. In this case, the meaning of the expression mukhandsikavacana 
must be clear to the person using the grammar. In the present case, the user 
needs to be consulted in order to ascertain whether the sound represented by 
the sound-set Xn is mukhandasikavacana or not. 


+a: anunasika 
1. Xm+a || Xm[a, i, y, |, u, e, ai, o, au, y, l, v] ^ XmM[mukhanasikavacana] 


JENAS: !9.2.6ll » from mouth and nose uttered sound is 
anunasika. 


The new representation indicates the semantic character of the condi- 
tions by using the letter M in conditional clauses. Thus the M part in XnM says 
that the sound-set Xn takes the attribute anunasika, if it fulfils the semantic 
condition judged through the user that the sound is uttered simultaneously 
through the oral and the nasal cavities (mukhanasikavacana). The first con- 
dition, namely, the sound-set Xn must contain one of the above mentioned 
sounds like a, i, r etc. is not mentioned explicitly in the original sūtra, but is 
included on the basis of the explanations of it. The following convention is 
used here: 


12. XmM is a meaning-condition associated with Xm. It implies that the 
user needs to be consulted in order to ascertain the admissibility of 
the meaning-expression. 


The next statement is of different kind. Instead of attaching an attribute it 
provides an answer to a particular kind of relation between two sound-sets. 
The relation to be tested is with respect to the attribute a. 


Ta: savarna 
1. XpXq?a || XpXaM[tulyasyaprayatna] ^ Xp[ac] ^ Xq[ac] 
2. XpXq?a || XpXaM[tulyasyaprayatna] ^ Xp[hal] ^ Xq[hal] 


gemmae «qui 12.2.41 » sounds with same articulatory effort are savarna. 
ABTS 12.2.20 ll > but not ac together with hal. 


The sütra: tulyasyaprayatnam savarnam says that two sounds are savarna 
when they have the same place of articulation (tulyasyaprayatna). Further, 
najjhalau clarifies that a vowel (ac) and a consonant (hal) can not be savarna, 
even if they have same place of articulation. This implies that either both of 
them are ac or both of them are hal sounds. This is specified by the condition 
clauses Xp[ac] ^ Xg[ac] which posit that both the sound-sets must contain 
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ac. The other option that both of them should contain hal is comprehended 
in the next part of the statement. It must be noted that for the application 
of this statement, one needs to know what the places of articulation are 
and which sounds are articulated from which part of the mouth. Thus, the 
meaning-expression needs further explanations. The conventions used in 
this case are recorded as follows: 


13. XpXq are two sound-sets (not necessarily contiguous). 

14. XpXq?a represents the question whether the sound-sets Xp and Xq are 
related with respectto the attribute a. The answer would be a boolean 
TRUE or FALSE. 


4.1.1.11 Attributes based on intention 


There is another category of attributes that are defined through a meaning- 
expression where the intention (vivaksd) of the speaker plays a central role. 
For example, attachment of the attribute kartr to any specific component 
shall depend upon the intention of the speaker, whether she or he intends 
to express kartr through that component or not. Thus, although the rule 
svatantrah kartta defines kartr to be that karaka which is independent of 
others (svatantra), yet its attachment to a particular component depends upon 
the intention of the speaker. 


*a: kartr 
1. X+a || XM[svatantra] 


CATH: di 2.9.1 > that karaka which is svatantra or independent of others 
is kartr. 


Here M in the condition clause XM[—] specifies that the intention of the 
speaker through the meaning-expression should be taken into consideration. 


4.1.1.12 Extending an attribute 


Sometimes, certain attributes are extended to components under particular 
circumstances. Tradition calls these provisions atidesa (extention) rules. 


+a: nit 


1. Xita || Xh[ga(n), kutadi] ^ Xi[pratyaya] ^ XiNOT[Ait][nit] 
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mge Rfs, I 2.2.2 ll > after ga(n) and kutadi those pratyaya not having 
ñit or nit are assigned nit. 


Here the positions of Xh and Xi have to be in this order. That means a 
suffix in the language-component Xi which has neither fiit or nit and which 
comes after the component Xh, having either the component ga(n) or the 
attribute kutadi, is attached the attribute nit. 


4.1.1.13 Removing an attribute 


There are cases when an attribute is removed under certain conditions. 
Consider the following example: 


-a: kit 


1. Xi-a || Xg[$i(n), svid(a), (fii)mid(a), (fii)ksvid(a), (fii)dhrs(a)] ^ Xh[i(t)] ^ 
Xi[nistha] 


C. CCS CS LC S ON 


TAST gag: 12.3.2 > nistha preceded by i(t) and coming after 
éi(n),, svid(a), (fii)mid(a), (ñi)kşvid(ā), (fii)dhrs(a) looses kit. 


Here, instead of adding, an attribute is removed. This is denoted by 
-a: which says that the attribute following after this should be removed 
from the appropriate language-components. The condition clauses specify 
the constellation. There are three language-components, Xg, Xh and Xi in 
this order. The language-component from which the attribute kit should be 
removed is Xi, specified by the attribute nisthà, which according to the rule 
ktaktavatü nistha!? stands for the components (k)ta and (k)tavat(u). Further, 
this will happen only when it is preceded by Xh having the augment i(t). The 
whole complex should follow Xg with one of the components listed in it. 


4.1.2 Combining the components 


The process of derivation involves the combination of the constituent compo- 
nents with one another. This leads to a change at the form level. A change at 
the form level can happen when a component is introduced. Further, changes 
can also be induced by replacement of a component, augmentation or when 
some of the phonetic features are changed, for example, when a short vowel 
becomes long. 


19 gei ABT 12.2.38 Il» (k)ta and (k)tavat(u) are nistha. 
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4.1.2.1 Introducing a component 


Within the formal framework, the combination of components is modeled 
through a fundamental operation, namely introduction of a new language- 
component. Consider the introduction of the suffix l(a)(t). 


<y: a) 
1. X«y || X[dhatu] ^ yM[vartamana] 


EIE BE 13.3.1 381I > to express present time, introduce l(a)(t). 


The basic structure of the above representation is similar to the attach- 
ment of the attributes. In the first line, however, the sign « represents the 
operation of introducing a component whose ID is mentioned after the 
colon. The second line contains the conditions. Its first part specifies the 
position where the new component should be placed. In this case, after a 
language-component X which has the attribute dhatu. Moreover, the second 
condition specifies that the component l(a)(t) is introduced when vartamana 
(present tense) is intended to be expressed. The user needs to be consulted 
for this decision. Hence it is indicated by the variable M in yM. 


4.1.2.2 Replacement of a component 


Combination of components also involves replacement of a component 
by another component. Consider the replacement of the lakara, say l(a)(t), 
by the suffix ti(p). Within the new framework, it can be represented as follows: 


«Sy: ti(p) 


1. Xi««y || Xh[dhatu][parasmaipada] ^ Xi[lakara] ^ yM[prathama]- 
[ekavacana] 


SA I3.9.99ll > in place of suffixes with cover term | namely l(a)(t), 1(i)(t), 
Iut) Y), e(t), KoH, Kai), Y)(n), uñ) and Yr)(n) the suffixes 
coming in the next rule are substituted. femeamrceesifireenearatarmariteatig- 
ARS N2¥.9¢ 1 »- ti(p) tas jhi si(p) thas tha mi(p) vas mas ta atam jha thas 
atham dhvam i(t) vahi mahi(n) are the substitutes of the cover term 1 or a 
lakara. 


The basic structure of this representation is again similar to the previous one. 
The operation, however, is replacement and the symbol <<y indicates this. 
The first line specifies the replacement operator followed by the replacement 
variable y, which after the colon is followed by the ID of the component. The 
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second line lists the conditions. The locus of the component which is to be 
replaced is specified in the first part before the vertical lines. In this case the 
component which is to be replaced is Xi, and the replacement is noted by the 
variable y. Finally, the conditions are stated. The two components Xh and Xi 
should be in this order. This is recorded in the meta-linguistic conventions of 
the new formulation. 


15. Xh and Xi indicate that the two components have to be in this order. 


Although a different operation, replacement is not a fundamentally new oper- 
ation, and in the formal representation it can be considered as a combination of 
introduction of a component and attachment of an attribute. Those sound-sets 
that get replaced are attached the attribute 6. The new component is placed 
after the replaced component. In the example above it would imply that the 
replaced language-component Xi gets the attribute ô and the new component 
ti(p) is placed after it. 


4.1.2.3 Augmentation 


Another operation is augmentation (agama) by phoneme-sequences that do 
not carry any specific meaning for the sake of attaining the desired surface 
form. For example, the rule zrdhadhatukasyedvaladeh provides the initial 
increment i(t) which is inserted at the head of an ardhadhatuka suffix that 
begins with a val phoneme.” 


<y: i(t) 
1. y>Xi || Xh[dhatu][anga] ^ Xi[ardhadhatuka] ^ Xib[val] 


e 


SIMI: ie 3.341l » the increment i(t) is inserted at the head of an 
ardhadhatuka affix which begins with a val phoneme. 


This is modeled in the same manner as the introduction of a new com- 
ponent. 


16. y»Xi indicates that the attachment here is an augmentation. 
17. In Xib the suffix b represents the initial sound-set. 


? The val phonemes are {y, r, 1, ñ, m, ñ, n, n, jh, bh, gh, dh, dh, j, b, g, d, d, kh, ph, ch, th, th, 
G t t k, p, $ $, s, h} 
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4.1.24 Change of phonetic form 


The form of a language-component may also change if the phonetic attributes 
are modified. For example, the short (hrasva) vowel may be changed to a 
long (dirgha) vowel. Similarly, the form level is also modified by the non- 
appearance of a sound. A component, or parts thereof, may undergo elision 
(lopa) i.e. non-appearance at the form level. For example, the rule lopo vyor- 
vali?! says that the sounds v or y are elided if a suffix beginning with a val 
sound follows. 


4.2 Structure of a statement 


Statements follow a fixed syntax. The structure of a general statement can be 
specified as follows: 


Operation-Declaration: ID 

1. Placement-Distribution || ANDed Conditions 
2. Placement-Distribution || ANDed Conditions 
3. Placement-Distribution || ANDed Conditions 


There are two main divisions. The first line, depicted in bold-case letters, 
declares the unique operation and the other lines appear as a numbered list 
in the printed version, and specify an independent set of conditions that need 
to be met in order to carry out the above operation. 


Within the operational declaration there are two parts separated by a 
colon. The first part specifies the nature or type of the operation. For example, 
attachment of an attribute is noted by *a. The other part specifies the ID of 
the component, either the ID of the attribute which is to be attached, or of the 
constituent component which is to be added. 


After the header line, there may follow one or more lines, appearing as 
a numbered list in the printed version, that provide conditions for the above 
operation. Each of the numbered condition lines constitutes a particular 
case, when the operation specified in the header line can be carried out. The 
conditions specified in different lines are mutually disjoint. 


A condition line itself consist of two parts separated by || two vertical 
lines. The first part consists of the structure or distribution of the derivational 
state and the location where the attachments or additions are to be made. The 
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21 STG AAS 1.2.88 > lopa replaces v or y if a suffix beginning with val follows. 
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second part records the conditions in terms of IDs that need to be present. 
The conditions within a line are ANDed. The different lines combined together 
with the logical OR cover all the cases when a particular operation is to be 
performed. 


The locus where a component should be added or an attribute should 
be attached is specified clearly using the exact variables for the correspond- 
ing sound-sets or language-components. The meta-rules in the Astadhyayt 
regarding the locus of grammatical changes are taken care of by explicitly 
specifying the relative positions of the variables and use of sub-scripts. 


The formal framework facilitates new methods and parameters to look 
into the complexity of grammar and its application for the derivational pro- 
cess. This can be formulated in terms of the types of operations, conditional 
complexity of a statement as well as inter-dependence and inter-relation 
between the statements. 


4.2.1 Types of operations 


The processes specified in the Astadhyayi can be summarised in terms of the 
new framework in the following types: 


*a: Attachment of an attribute. An attribute can be attached to one or more 
sound-sets, or to an entire language-component. 


-a: Removing an attribute. There are instances when some attribute is taken 
out or removed from a sound-set or a language-component. For example 
in certain cases the attribute kit is removed (see the rules 1.2.7-1.2.26). 


?a: Checking the consistency. As a consistency check, statements are mod- 
eled as a question whose answer can either be TRUE or FALSE. 


Xy: Addition of a component. 


These are the four fundamental types of operations. Other operations like re- 
placement or elision can be implemented as a combination of the above fun- 
damental operations. Thus, 


<<y: Replacement or substitution can be implemented as a combination of 
(i) attachment of a new attribute 6 to that part which is replaced and (ii) 
addition of the component which is the replacement at the appropriate 
place. Elision or lopa would involve attaching the attribute lopa to the 
concerned parts in the language-component. The sound-sets containing 
the attribute lopa would then not be expressed at the form level which is 
the audible layer. 
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4.2.2 Complexity of a statement 


Consider again the representation of attachment of the attribute vrddhi in 
the new framework. 


*a: vrddhi 
1. Xm+a || Xn[at, aic] 
gR, 12.2.2 ll > vrddhi stands for at and aic. 
Leaving aside the grammatical elements, the structural form of this statement 
would be as follows: 
+a: — 


Xm+a || Xm [—] 


We can term this as the signature of the above statement. The signature of a 
statement comprehends primarily the structural complexity of the conditions 
that need to be satisfied in order to apply that statement. 


Looking at the following representation of attachment of the attribute 
guna, it is apparent that it has the same signature as the previous statement 
for attachment of the attribute vrddhi. 
*a: guna 

1. Xmta || Xm[at, eri] 
weg: l| $.2.31l > at and eri are guna. 


Let us now consider attachment of the attribute laghu. 
+a: laghu 
1. Xmta || Xn[hrasva] ^ XnNOT[samyoga] 


Set Sy lt9ell » hrasva is laghu. AÌ Ww 12.89.221 > when samyoga 
follows, it is guru. 


The corresponding signature would be as follows. 


*a:— 
Xm*a || Xn[—] ^ XnNOT [—] 


Similarly, for attachment of the attribute guru, we have the following state- 
ment. 
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*a: guru 

1. Xmta || Xn[hrasva] ^ Xn[samyoga] 

2. Xmta || Xm[dirgha] 
aed cx !9.9.29!| > hrasva is laghu. Gar Te 2.4.22 » when samyoga 
follows, it is guru. di F I 9.9.23 Il > dirgha as well. 


The signature for attachment of guru is as follows. 


+a:— 
Xmta || Xm[—] A Xn[—] 
Xm*a || Xm[—] 


If one now compares the statements for attachment of vrddhi or guna with 
that of laghu or guru, one notices that the operation is the same in all cases, 
namely attachment of an attribute to some sound-set. What is different is the 
nature and complexity of the conditions. There are three different types of 
structures for the conditions: 


Xm*a || Xm [—] 
Xmta || Xn[—] ^ XnNOT [—] 
Xmta || Xn[—] A Xn[—] 


Attachment of many of the attributes discussed before have signatures 
similar to the one mentioned above. In contrast, attachment of attributes like 
samyoga, upadha or ti that depend on the distribution of adjacent sounds 
have very different signatures. 
+a: samyoga 

1. XmXn+a || Xm[hal] ^ Xn[hal] ^ IDX [n2m*1] 


ESSAAT: FART: 12.2.9 » hal in contiguous sequence are samyoga. 


The corresponding signature would be: 


tarac 
XmXn+a || Xm[—] ^ Xn[—] A IDX[—] 


The conditions for grammatical operations can be summarized as a combina- 
tion of the following basic types: 


The intention (vivaksa) of the speaker plays a central role in the introduction of 
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many components. It is formulated through meaning-expressions and is usu- 
ally stated in the locative case. For example, consider the introduction of the 
suffix l(a)(t). This is introduced if the speaker intends to express present tense. 


Xy: l(a)(t 
1. X«y || X[dhatu] ^ yM[vartamana] 


EIE BE 113.3.?381l > to express present time, introduce l(a)(t). 
The signature of this statement is as follows: 


Ge 
X<y || XL—] A yM[—] 


The second part of the condition clause yM[—] denotes that the intention of 
the speaker communicated through the meaning-expression is required for 
the application of the statement. 


The presence (or sometimes absence) of attributes often provides condi- 
tions for the grammatical operations. For example, in the above statement 
for introducing the suffix l(a)(t), the first part of the condition clause XL—1 
requires the presence of the attribute dhatu (verbal root) in the component 
to which the suffix can be added. The presence of specific components quite 
frequently conditions the grammatical operations. 


<y: sa(n) 
1. X<y || X[gup(a), tij(a), kit(a)] 
JARRA: HA 3.2.4 1l > after gup(a), tij(a), kit(a) introduce sa(n). 
Here, the condition clause X[—] is similar in structure to the previous 
example. The only difference lies in the type of elements that populate this 


condition. In the previous case it was the attribute, and in the present example 
these are the specific components, namely, gup(a), tij(a) and kit(a). 


A specific constellation or distribution of the components also provides 
a condition for certain operations. Take the example of attachment of the 
attribute ti. 
ta: ti 

1. Xbeta || Xb[ac] ^ XmNOT[ac] ^ IDX[m=(b+1 TO e)] 
Asen È 0 ¢.2.8¢ ll > that part which begins with last ac is ti. 
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The condition has to take into account a specific distribution of sound- 
sets, namely that part which begins with the last vowel and extends to the 
end of the language-component. Similarly, the rule yasmatpratyayavidhistadadi 
pratyaye’ngam specifies anga to be that portion after which a suffix is pre- 
scribed and when the suffix follows. Such instances of attribute attachment, 
therefore, require looking at the current derivational state and, based upon 
how the different components are placed there, deciding whether a particular 
attribute should be attached or not. These are dynamic attributes. 


Specific grammatical processes may also form a condition. In certain 
cases not only a particular derivational state needs to be looked at, but the 
progress or change from one state to the next is important. 


For example, the rule igyanah samprasüramam?? terms samprasarana to 
the ik sounds, i.e. [i, u, r, |] which come in place of yan i.e. [y, v, r, 1]. In order 
to represent conditions of this kind, we need to look into the history of a 
particular component. This is facilitated by looking into the derivational 
states and slices that store the derivational process. 


Mutual relations of certain grammatical processes or changes also set 
conditions. For example, the rules listed in the last three sections of the eighth 
chapter of the Astadhyayi. By the rule pürvatrasiddham^^ the rules listed later 
are in effect suspended with respect to previous rules. Consider the following 
situation in which a derivational state st; advances to the next state st, after 
application of a rule A. 


A (sti) — ste 


If there is another rule B which is related to the rule A in such a manner that 
A is suspended (asiddha) with respect to B, it implies that B can not operate 
on the state st» even if the conditions for the application are fulfilled. Since 
A is suspended with respect to B, it means that for rule B it is as if A was not 
applied at all. So only the state st; is visible to B. The implementation of the 
statements from this section would again require access to the derivational 
history by accessing the earlier derivational states. 


22 qeqrercaafafedara AASA 112.9. ?3 > that part which enjoins a pratyaya based opera- 
tion, before that the sequence is anga. 

25 TOU: CAAT ll 2.2.2 ll > ik replacing yan is samprasarana. 

24 ARTEA 1.3.2 ll > that which follows from here onwards is treated as if suspended in 
view of what precedes. 
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4.3 Relation between statements 


The signature of a statement specifies its structural form and comprehends the 
form of the conditional clauses. The manner in which any two components of 
grammar are related can now be formulated in terms of the categories of the 
new framework. Consider again the statement for attachment of the attribute 
vrddhi : 


*a: vrddhi 
1. Xm+a || Xm[&t, aic] 


afar, 2.2.2. > vrddhi stands for at and aic. 


The condition clause requires the presence of the attribute af or aic. This 
implies a hierarchy between vrddhi on the one hand and àt or aic on the other 
hand. This would also imply that the statement that attaches the attribute 
vrddhi will only then be relevant once the statements for attachment of at or 
aic have been applied. The statements that attach the attributes at and aic to 
some sound-set are as follows: 


ta: at 
1. Xm+a || Xm[a][dirgha] 


AREAS !2.2.99!l » an ac followed by the marker t stands for sounds 
having the same time duration. 


+a: aic 
1. Xm*a || Xn[ai, au] 


Vat vll » ai, au, (c). SAA GSAT 12.2.9? ll » an initial element together 
with the final it sound includes intervening elements. 


One can say that the statement for attachment of the attribute vrddhi 
presupposes the application of the statements for the attachment of the 
attributes āt or aic. 


Given two statements: St-A and St-B, St-B is dependent on St-A if the 
application of St-B requires the results of the application of St-A. 


The dependency graph for the components and attributes of Astadhyayi would 
provide an order in which the statements will have to be applied. The problem 
of inter-dependence or circularity, i.e. St-B is dependent on St-A and St-A is 
dependent on St-B, can be resolved by introducing a new statement, let us 
say, St-B1 with a different ID. 
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The examples from the operational rules of Astadhyayr which I have repre- 
sented in the new framework show that a formal representation of Astad- 
hyayi is possible. It is also possible to reduce the role of an individual for 
applicational purposes, although grammar functions integrally together with 
the speaker or user. The new representation also aims to clearly separate the 
form or structural complexity of the grammar from the inter-relatedness of its 
components. It provides a mechanism to look into these relations at the more 
granular level of individual components, in contrast to attempts to organize 
them within the categories of semantics, syntax, morphology and phonology. 
The present framework allows a flexible integration of information which is 
either not explicitly mentioned in the grammatical corpus or which can be in- 
cluded from other sources, e.g. the inclusion of the suggestions of Katyayana 
in his Varttika or the Unadi-sütras. 


Chapter 5 


Implementation 


A computer implementation of the content and processes of Astadhyayi can 
be aptly compared with the automatic machines for buying railway tickets. 
When interacting with such a machine, a traveller needs to supply specific 
information about the destination, route, date and time, number of travellers, 
class etc. Depending upon the interactive design of the program this informa- 
tion is required by the machine as and when it needs to take some decision 
for which the wish and consent of the traveller is necessary. On its part, the 
machine can put constraints upon the choices depending upon the current sit- 
uation, e.g. whether an option to take a particular route or train on a particular 
day is possible or not. Finally, the desired ticket is printed. 


In a similar manner, a computer program that implements the derivation of 
a linguistic expression requires input by the user regarding her or his inten- 
tion (vivaksa@). This information is necessary to make appropriate decisions so 
that finally the desired expression is formed. The program, on its part, puts 
constraints upon the possible choices that are admissible. 


In the following, I provide an overview of the computer implementation of 
the Paninian system. As discussed in the previous chapters, my rendering of 
the grammar is through its formal representation. Therefore, the basis of the 
present computerisation is the formalisation discussed in chapters 3 and 4. 


The programming codes specified here aim to show that my model of the 
Astadhyayi facilitates its computer implementation. Because of the con- 
straints of the space, there is no attempt to furnish the details which run into 
several thousand lines of program codes. Moreover, thus far there is no user 
interface for the system. This is because it does not directly contribute to the re- 
search questions and would simultaneously require considerable time which 
is beyond the scope of the present work. The programs are written in the 
Python Programming Language which runs on Windows, Linux/Unix, Mac 
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OS X and is available under open source license.! Needless to mention, there 
can be different ways of implementation depending upon the overall design 
of the program, the nature of the programming environment as well as the 
choices of the implementational platform. 


There are two main aspects of the computer implementation which I will deal 
with in this chapter. Firstly, how to implement the grammatical content on 
a computer, and secondly, how to simulate the processes of derivation. The 
former has to deal with the data-structures of the program and the latter with 
its application and dynamics. 


The basic categories of the formal framework introduced in chapters 2 and 3 
aim to represent the grammar in such a way that its application can be effected 
in an algorithmic manner. The data-structures specified below are developed 
to render the categories of this formal framework. 


5.1 Data-structures 


5.1.1 Elements 


A collection of elements—components, attributes and meaning-expressions— 
constitutes the basic database. A typical entry of this database looks as follows: 
paTha_a; 

ait_9 udAttet_9; 

p-0; 

a 0 hrasva O0; 

Th 1; 

a 0 hrasva O it O udAtta O 


The above string is separated by semi-colons (;). Here, the first entry (line 1) 
is the ID of the element. It is the ID of the verbal root paTha_a. After this (line 
2) the IDs of its attributes are noted with a blank space in between. There 
are two IDs mentioned here: ait 9 which stands for the attribute that has 
the phoneme a 0 as it 0 marker. Secondly, the attribute udAttet 9 says that 
the marker sound here is high-pitched. This is followed by entries for each 
phoneme and their attributes. There are four phonemes here (lines 3-6). The 
final phoneme a 0 carries the attribute it 0 and udAtta O0, that makes the 
whole component ait, 9 and udAttet 9. 


The collection of elements is implemented by the Elements class. 


1 The Python Programming Language—Official website https: //www.python.org (ac- 
cessed on 22.03.2016) supplies free download of the Python interpreter as well as compre- 
hensive documentation. 
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class Elements: 
def __init__(self): 
self.elements = dictionary_of_elements (ELEMENTS_FILE_NAME) 


The class variable is a dictionary of elements initialized by the file 
ELEMENTS_FILE_NAME. The class of elements can be instantiated as follows. 


>>> from elements import Elements 
>>> element = Elements () 


The first command imports the class of Elements and the second one instan- 
tiates it. The variable element is now an object variable belonging to the El- 
ements class. The class functions can now be executed. For example, the fol- 
lowing function returns the boolean value True or False, depending upon 
whether an element is present in the database or not. 


>>> element.is an element('paTha a') 
True 


Another function of this class returns a component as a list of sets, which can 
be represented as language-components. 


>>> element.get component in langComp form('paTha a') 
[set (['paTha a','udAttet 9','p O','ait 9']), 
set(['hrasva O','paTha a','udAttet 9','a O','ait 9']), 
set(['paTha a','udAttet 9','Th 1','ait 9'])] 


A list consisting of three sets is returned in the above example. These three sets 
correspond to the three sounds of the said component. It should be noted that 
certain attributes, like ait 9 i.e. having a as it-marker or udAttet 9 having 
an udatta-marker are included in the database itself. 


The value returned is suitable for representation in the new framework. Thus, 
the final marker sound is not included as part of the form of the component, 
but as an attribute. Thus, it has only three sets corresponding to the three 
sounds /p a th/, and not four sets. This is because the final phoneme in the 
original corpus is only a marker sound. 


5.1.2 Sound-sets 


Each of the three sets representing the three phonemes of the compo- 
nent paTha a are stored as sound-sets which are implemented by the class 
SoundSets. 


class SoundSets: 
def | init  (self,a set of item ids): 
self.soundSet = a set of item ids 
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It is instantiated by a set of IDs. For example, the above list of three sets can 
be represented through three SoundSets. 


>>> from soundSets import SoundSets 

>>> soundSet_1 = SoundSets(set(['paTha_a','udAttet_9', 
'p.0','adt 9'])) 

>>> soundSet 2 = SoundSets(set(['paTha a','udAttet 9', 
"hrasya 0'.!2 0T, tait 9^])) 

>>> soundSet 3 = SoundSets(set(['paTha a','udAttet 9', 
"Th 4*5, adt 9*'])) 


A sound-set contains a collection of IDs. There is, however, an important con- 
straint: it must have exactly one element from the following set of fundamental 
sounds. 


FUNDAMENTAL SOUNDS-['a 0O','i 0','u O', 


Cm Lo E E 

a i. ie p. 

'ad 0*, eu", 

‘0 hy LA NUM 
QUIM 

es M RO ELA E ae 
ejh 9 *, "bh. 9v, 


"gh 0', Dh 1*',*dh O*, 


eT ^, 5 :0*, e 0^.tp 1*, qd 0', 
Uh 0*,"ph.0*, "ch. 0", Th.1','th. 0','c.0',"'T 1','t. 0*, 


da nr. pot, 

ta Qu tg $*" vs i. 
"NOU, 

SH p MEE, 

'x 0'] 


In the above list, IDs in the first fourteen lines correspond to the fourteen Siva- 
sütras. In the 15th line, H 1 and M 1 correspond to the aspirated sound visar- 
janiya and the nasal sound anusvara respectively. Finally, x 0 in the 16th line 
is for a pause or virama. 


Because of the constraint that exactly one ID from the above set must be 
present within a sound-set, the following set of IDs are invalid candidates 
for a sound-set. 


>>> from soundSets import SoundSets 
>>> soundSet_4 = SoundSets(set(['paTha_a'])) 
Invalid sound-set! 


The soundSet_4 does not contain any ID from the above set of 
FUNDAMENTAL_SOUNDS and hence is an invalid set for a sound-set. 


>>> soundSet 5 = SoundSets(set(['a O','i O'])) 
Invalid sound-set! 


Here, soundSet_5 has more than one ID from the set of FUNDAMENTAL SOUNDS. 
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The following function of the class returns the phonetic form of a sound-set. 
The phonetic form may contain additional attributes like the length or into- 
nation of vowels. 


>>> soundSet_1.get_phoneme_as_a_set() 
set(['p_0']) 

>>> soundSet_2.get_phoneme_as_a_set() 
set(['hrasva 0O','a 0']) 


A new attribute can be added to a sound-set. The following function imple- 
ments it. 


>>> soundSet 2.addAttributes(set(['at 1'])) 
>>> soundSet_2.get_soundSet () 
set(['paTha a','udAttet 9','a O','hrasva O','at 1','ait 9']) 


Here, the attribute at. 1 is added to a sound-set. While adding the attributes, 
the consistency condition of the sound-set is taken care of. Thus, an attribute 
like dIrgha, 0 can not coexist with hrasva, 0 within the same sound-set. 


>>> soundSet 2.addAttributes(set(['dIrgha O'])) 
dIrgha O0 cannot coexist with hrasva O !! 


Similarly, existing attributes can be removed from a sound-set. 


>>> soundSet 2.remAttributes(set(['at 1'])) 
>>> soundSet_2.get_soundSet () 
set(['paTha a','udAttet 9','a O','hrasva O','ait 9']) 


5.1.3 Language-components 


Language-components constitute an intermediate unit between the whole 
sentences and the individual sounds. They may roughly (but not necessarily) 
correspond to an inflected word within a sentence. From the point of view of 
data-structures, they are a list of sound-sets. The LangComps class implements 
the language-components. 


class LangComps: 
def | init. (self,list of sets-[]): 
self.langComp-[SoundSets(ss) for ss in list of sets] 


In order to initialize a language-component, the component should be ren- 
dered in a special form, namely as a sequence of sets of IDs. This is achieved 
by a function from the Elements class. 


>>> from elements import Elements 
>>> element = Elements() 
>>> paTha-element.get component in langComp form('paTha a') 
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>>> print paTha 

[set (['paTha a','udAttet 9','p O','ait 9']), 
set(['hrasva O','paTha a','udAttet 9','a O','ait 9']), 
set(['paTha a','udAttet 9','Th 1','ait 9'])] 


An object of the class LangComps can now be instantiated by using the above 
rendering of the component paTha a. 

>>> from langComps import LangComps 

>>> langComp = LangComps (paTha) 

>>> print langComp 

p : p.O,ait 9,paTha a,udAttet 9 * 

a : hrasva O,a O,ait 9,paTha a,udAttet 9 * 

Th : Th 1,ait 9,paTha a,udAttet 9 


The output shows that the language-component has three sound-sets repre- 
senting the sounds /p, a, Th/ respectively. The sound-sets contain several 
other IDs that characterise them as well as the language-component. 


A number of functions are required to execute operations on language- 
components. For example, in order to check the conditions, it is important to 
identify the range of indices in which some attribute occurs. The question as 
to which sound-set in a particular language-component has any of the given 
attributes is implemented by the following function. 


>>> langComp.range withAny(['hrasva 0']) 
[1] 


It says that the attribute hrasva O0 is in the second sound-set of the current 
language-component.? In case more than one IDs are searched, then this func- 
tion returns all indices where any of the IDs occur. 


>>> langComp.range_withAny(['a_0','p_0']) 
[0, 1] 


In the example above, indices of those sound-sets that contain any of the IDs 
in the list ['a_O', 'p_O'] are returned. 


If one wants to attach the attribute dhAtu O to those parts of language- 
component which contains the ID paTha a then the range of indices with the 
ID paTha a needs to be identified first, followed by the addition of the at- 
tribute dhAtu O0 to the respective indices. 


>>> langComp.range withAny(['paTha a']) 

[0, 1, 2] 

>>> langComp.addAttributes(set(['dhAtu 0']1),[0,1,2]) 
>>> print langComp 

p : p.O,ait 9,dhAtu O,paTha a,udAttet 9 * 

a : hrasva O,a O,ait 9,dhAtu O,paTha a,udAttet 9 * 
Th : Th 1,ait 9,dhAtu O,paTha a,udAttet 9 


? The lists in Python programming language are indexed beginning with 0. So the first ele- 
ment of a list List is list [0] and the second one is list [1] etc. 
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The above example shows that the attribute dhAtu O0 is added to the indices 
corresponding to the sound-sets that contain paTha a. 


5.1.4 Sentences 


Sentences consist of one or more language-components. They represent the 
whole unit of a typical linguistic expression with the possibility of a number 
of inflected words. This class is necessary since the rules of grammar consider 
the whole sentence and not just one word to be the unit of derivation. 


From the point of view of a formal representation of grammatical processes, 
sentences can simply be defined as a sequence of language-components. Ac- 
cordingly, the class of Sentences is implemented as a list of LangComps. 
class Sentences: 


def | init | (self,list of LangComps-[LangComps O]): 
self.sentence - list of LangComps 


The present formal framework uses three levels to represent any linguistic 
expression. Sentences correspond to the whole unit of a particular linguistic 
expression, while sound-sets correspond to the individual sounds. Language- 
components are an intermediate level between the two and are tentatively re- 
lated to an inflected word. Depending upon the level from which conditional 
information can be gathered in a sufficient manner, the grammatical opera- 
tions can be distinguished as those that apply to a sound-set or to a language- 
component or at the level of the whole sentence. 


5.1.5 Derivational states 


The process of derivation is carried out when an operational statement is ap- 
plied to a sentence or to its constituents, i.e. the language-components or the 
sound-sets. Together with an operational statement, a sentence forms the next 
data-structure of the system, namely the class DStates. 


class DStates: # Derivational state 


def | init. (self,(sentence,statement string)-(None,None)): 
self.dState = sentence 
self.applied statement str - statement string 


The application of a particular statement brings about some change in the 
current state of a sentence. This change may be at the level of a sound-set if, 
for example, it gets a new attribute, or at the level of a language-component, 
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for example, addition of a new sound-set, or even at the level of a sentence 
itself, in the case of addition of new language-components. The dState vari- 
able of the class saves the changed state of the sentence after the application 
of some statement. The operational statement which is applied is stored in the 
variable applied statement str (see section 5.1.8 for the nature of an oper- 
ational string). 


5.1.6 Slices 


The current state of a sentence saved in a derivational state is the result of 
application of an operational statement on the previous state of that sentence. 
The sequence of such derivational states is stored in a slice and is implemented 
by the class Slices. From the point of view of data-structures, a slice is simply 
a sequence or list of derivational states or DStates. 

class Slices: 


def | init, (self,list of DStates-[]): 
self.slice = list of DStates 


There are two kinds of changes that the operational rules of grammar bring 
about: either the derivational state is saturated or it progresses towards com- 
pletion. The process of saturation is associated with attachment of attributes, 
while addition of new components is related with incremental steps of com- 
pletion of the derivational process. Slices contain only those changes where 
the derivational process is saturated, i.e. only when attributes are added to 
the components. The other case, when a new component is introduced, leads 
to the formation of a new slice. 


5.1.7 Process-strips 


The incremental steps of completion of the process of derivation results in a 
sequence of slices. Process-strips record this sequences of slices. This is imple- 
mented through the class PStrips. 
class PStrips: # process-strips 
def | init, (self,list of Slices-[]): 
self.pStrip-list of Slices 


The data-structures introduced thus far adequately represent the constituents 
and processes of the grammar in an integral manner. The processes of gram- 
mar are enacted through the operational rules. The next data-structure com- 
prehends them. 
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5.1.8 Statements 


In the previous chapter, the concept of statements in comparison with the 
sütras was introduced and specified. Statements are operational rules that are 
formulated in a formal framework and can be implemented through algorith- 
mic functions. 


The information related to a statement is stored in a particular format within 
the database. In order to point out the structure of the database of statements, 
consider the entry corresponding to the attachment of the attribute vRddhi_0. 
In the original corpus it is the very first sūtra. 

Xm_ATT_a * 

ATT_a & vRddhi O * 

Xn:[At 2,aic O] AND Xm NOT:[vRddhi 0] * 

ST TYPE:[STABILIZING] * 

A RULES: [a 11001] 


The above entry consists of five parts that are separated by a * sign. Each part 
is listed here in a separate line. 


1. The first part denotes the type of the statement. In the present case, it is 
given by Xm ATT a. It implies that this statement is about ATTachment of 
an attribute a to some sound-set Xm. 


2. The second part specifies the operation. In the present case it is given by 
ATT a & vRddhi O.Itimplies that vRddhi_0 is the ATTachment here. 


3. The third part notes the conditions that should be fulfilled in order to ex- 
ecute the operation. There are two parts and both of them need to be ful- 
filled. Hence they are conjoined by the logical AND. 


a. The first part of the condition is given by Xm: [At, 2,aic 0] that can be 
interpreted as the presence of either the attribute At. 2 or the attribute 
aic Oin the sound-set Xm. 


b. The second part Xm, NOT: [vRddhi 0] ensures that the attribute is at- 
tached only if it is not already included in the said sound-set. This is 
important to avoid recursive attachment of an attribute. 


4. The inter-relations between other statements within the database are 
noted in the fourth part. It also records the nature of the statements. Here, 
for example, it says that this statement is a STABILIZING statement which 
contributes to the saturation of the slice. 


5. Finally, the fifth part records the links to the external associations, es- 
pecially the correspondence with the original corpus of the Astadhyayi. 
Here, the sütra number a. 11001 is noted. 
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An individual operational statement is implemented by the Statements class 
and is instantiated by the corresponding entry string within the database. 


class Statements: 
def | init  (self,st strz''): 
self.statement string-st str 
self.statement = parse a statement str(st str) 


The function parse a statement str(st str) parses and returns the out- 
put as a five-tuple for the instantiation variable. 


>>> statement string = 'Xm ATT a * 
ATT à &£ vRddhi O * 
Xm:[At 2,aic O0] AND Xm NOT:[vRddhi O] * 
ST TYPE:[STABILIZING] * 
A RULES:[a 11001]' 
>>> from statements import Statements 
>>> statement = Statements(statement string) 


This class implements several functions that are important for the application 
of the statements. One such function is to get the signature of a given statement. 


>>> statement.get signature() 
'Xm ATT a. ATT a, Xm Xm NOT' 


The signature of a statement specifies its structure. Associated with a state- 
ment, there is a function which executes its application. The nature of this 
applicational function is defined by the signature of the statement. All state- 
ments with the same signature can be applied by using the same function. 


Another function supplies the information about the conditions that need to 
be fulfilled in order to apply that statement. 
>>> statement.get condition type vals dict () 
('Xn': [['4At 2','aic 0']], 
'Xm NOT': [['vRddhi 0'11) 


Here, the return value is a dictionary. Its keys are the types or nature of the 
conditions together with the corresponding values for the particular case. 
Thus, the first condition type is Xn implying that the sound-set must contain 
any of the IDs At. 2 or aic O0. The type of the second condition is Xm, NOT and 
it says that the ID vRddhi | 0 should not be in that sound-set. 


5.1.9 Statement groups 


The collection of Statements is implemented through the StatementGroups 
class. 
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class StatementGroups: 
def __init__(self): 
self.statementGroup = list_of_Statements 


The functions of this class are primarily meant for organizing and referencing 
the statements. 


>>> from statements import StatementGroups 

>>> statementGroup = StatementGroups () 

>>> statementGroup.get statements for some operation( 
'ATT à & guru O') 
[<statements.Statements instance at 0x10a24c3f8>, 
<statements.Statements instance at 0x10a24c488>] 


The list returned by the above function consists of two Statements instances 
corresponding to the two statements that provide for application of the at- 
tribute guru_0. 


>>> sti,st2-statementGroup.get statements for some operation( 
'ATT à & guru 0O') 

>>> sti.statement string 
'Xm ATT à * ATT à & guru O * Xm:[hrasva O0] AND Xn:[saMyoga_0]* 
ST TYPE:[STABILIZING] + A RULES: [a 14010] [a, 14011] ' 

>>> st2.statement string 
'Xm ATT à * ATT a & guru O * Xm:[dIrgha O0] * 
ST TYPE:[STABILIZING] * A RULES:[a 14012]' 


Similarly, the following function returns a dictionary with signature of state- 

ments as keys and the corresponding statements as their values. 

>>> d-statementGroup.get signature statements dict() 

>>> d 

f'Xm ATT a. ATT a. Xm Xm NOT': [ 
<statements.Statements instance at 0x10a382638>, 
<statements.Statements instance at 0x10a382950>, 
<statements.Statements instance at 0x10a382998>, 
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5.2 Processes of grammar 


During the process of their derivation, linguistic expressions are represented 
through a sentence which consists of one or more language-components 
corresponding to the individual inflected words. Each language-component 
contains a sequence of sound-sets. Each sound-set corresponds to a single 
phoneme. 


The derivational process is effected through a number of operational state- 
ments which are applied to a sentence. A sentence, together with an opera- 
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tional statement, constitutes a derivational state. There are two fundamental 
types of operations: (i) to saturate a sentence, in that all attributes that can 
be attached are added to it, and (ii) to add a new component and graduate 
towards completion of the derivational process. Accordingly, a slice contains 
a sequence of derivational states that arise during the process of saturation. 
A new slice is added, once a new component is introduced. A process-strip 
records a sequence of slices and thus registers the process of completion. 


Given the above framework and corresponding data-structures, the general 
algorithm of the derivational process can be specified as follows. 


initialize a process-strip 
repeat the following steps: 
saturate the process-strip 
look for completing statements 
if there is no statement to be applied 
return the process-strip 
else: 
select a completing statement 
apply it to the process-strip 


After initialisation, the process-strip is populated with new components and 
saturated repeatedly, till there is no admissible component available. This 
brings the process of derivation to an end. 


5.2.1 Initialisation 


The process of initialisation is implemented by the function initializeO. 
The initial process-strip contains an empty slice. 


>>> from pStrips import PStrips 
>>> pStrip = PStrips() 

>>> pStrip.get list of Slices() 
[] 


The empty pStrip needs to be populated with some components. At this mo- 
ment the meaning-expressions and a user become relevant. The user must be 
able to express her/his intention (vivaksz) by interacting with the system. Sup- 
pose, for example, the user wants to express the sentence balakah pathati (a boy 
recites). Then the choice of the components bAlaka_k and paTha_a become 
imminent.? The following statement is chosen for application. 


>>> st01 = Statements ( 
'E ADD y * 


3 For the sake of simplicity and space, I will continue with the derivation of the verbal 
conjugation only. 
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ADD y & paTha a * 

yM:[vyaktAyAM x vAci x] * 

ST TYPE:[COMPLETING] * ') 
>>> stOi.get signature() 

'E ADD y. ADD y. yM' 


The signature of the above statement is significant for choosing the appro- 
priate signature-functions. These are required to execute some statement. 
Consider the following function for the application of statements with signa- 
ture: E ADD y ADD y... yM. 


def E ADD y. ADD y. yM(pStrip,statement): 
(op type,op val) = statement.get operation part () 
list of sets-Elements().get component in langComp form(op val) 
langComp = LangComps(list of sets) 
sentence = Sentences ( 

([langCompl,statement.get vals for condition type('yM'))) 

dState = DS8tates((sentence,statement.get statement string())) 
slice = Slices([dState]) 
pStrip = PStrips([slice]) 
return pStrip 


[1] The function takes pStrip and statement objects and returns the updated 
pStrip after application of the statement. [2] The values of operation-type 
and operation-value are stored in the variables op_type and op_val respec- 
tively. These are 'ADD_y' and 'paTha_a'. [3] The variable list of sets con- 
tains the list of sets of IDs corresponding to the op_val which in this case is 
'paTha a'. [4] The langComp is instantiated, followed by [5] sentence, [7] 
dState, [8] slice and [9] pStrip. [10] The updated pStrip is returned. 


The application of the above signature function results in the introduction of 
a new slice within the process-strip. Within this slice a new derivational state 
is added which records the changes in the sentence and saves the statement 
that is applied as well. The results are as follows. 


>>> from signatureFunctions import E ADD y ADD y  yM 
>>> pStrip = E ADD y ADD y. yM(pStrip,stO1) 
>>> print pStrip 
p : p.O,ait 9,paTha a,udAttet 9 * 
a : hrasva O,a O,ait 9,paTha a,udAttet 9 * 
Th : Th 1,ait 9,paTha a,udAttet 9 
:-: E ADD y * ADD y & paTha a * 
yM:[vyaktAyAM x vAci x] * 
ST_TYPE: [COMPLETING] * 
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5.2.2 Saturation 


At this stage the process of saturation is carried out. Consider the following 
statement. 


>>> st02 = Statements ( 
'Xm_ATT_a * 
ATT a & at 1 * 
Xm:[a O][hrasva O] AND Xm NOT:[at 1] * 
ST TYPE:[STABILIZING] +* 
A RULES: [a 11070]') 


This statement attaches the attribute at 1 to a sound-set Xm that fulfils the 
following two conditions: 


1. Xm: [a, 0] [hrasva, 0] i.e. the sound-set Xm must contain the phoneme a, 0 
and the attribute hrasva_0. 


2. Xn. NOT: [at 1] implies that the attribute should not already be present in 
the said sound-set. This is necessary to avoid recursive attachment of an 
attribute. 


The signature of this statement is as follows. 


>>> stO2.get signature() 
'Xm ATT a. ATT a, Xm Xm NOT' 


The implementation of this statement is effected through the following 
signature-function. 


def Xm ATT a. ATT a. Xm Xm NOT(pStrip,statement): 
sentence = get a deepcopy of sentence(pStrip) 
for langComp in sentence.sentence: 
for soundSet in langComp.langComp: 
chk results dict = Xm ATT a ATT a. Xm Xm NOT CHECK( 
soundSet,statement) 
if chk results dict.get('APPLICABILITY'): 
soundSet.addAttributes( 
set([chk results dict.get('ATT a')]1)) 
pStrip.get last Slice().extend Slice( 
DStates((sentence,statement.get statement string()))) 
return pStrip 


[1] Again the function takes up the pStrip and a statement and [12] returns 
the updated strip after the application of the statement. [2] A deep copy of 
the sentence is needed to avoid over writing. [3-4] Since the operation is ex- 
ecuted at the level of sound-sets, the two £or loops are carried out. [5-6] The 
function _...__CHECK checks the conditions whether the statement is appli- 
cable to the soundSet or not. The results of this check function are stored in the 
dictionary chk results dict. [7] If applicable, then [8-9] the appropriate 
attribute is added to that soundSet, and [10-11] pStrip gets updated. 
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The above function uses another function to check the conditions. 


def Xm ATT a ATT a Xm Xm NOT CHECK(soundSet,statement): 
chk results dict = {} 
oper part key, oper part val = statement.get operation part () 
if CHK Xm Xm NOT(soundSet,statement): 
chk results dict['APPLICABILITY'] = True 
chk results dict[oper part key] = oper part val 
return chk results dict 


[1] The ... CHECK function takes a soundSet and a statement and re- 
turns chk results dict a dictionary of results. [4] It uses another function 
CHK Xm Xm NOT that checks whether the Xm and Xm NOT conditions are ful- 
filled. 


The statement st02 can now be applied. 


>>> from signatureFunctions import Xm ATT a  ATT a, Xm Xm NOT 
>>> pStrip = Xm ATT a ATT à Xm Xm NOT(pStrip,stO2) 
>>> print pStrip 
p : p.O,ait 9,paTha a,udAttet 9 * 
a : hrasva O,a O,ait 9,paTha a,udAttet 9 * 
Th : Th, 1,ait 9,paTha a,udAttet 9 
:-: E ADD y * ADD y & paTha a * 
yM:[vyaktAyAM x vAci x] * 
ST TYPE:[COMPLETING] * 


p : p.O,ait 9,paTha a,udAttet 9 * 
a : hrasva O,a O,ait 9,at 1,paTha a,udAttet 9 * 
Th : Th 1,ait 9,paTha a,udAttet 9 
:-: Xm ATT a * ATT_a & at 1 * 
Xm:[a O][hrasva O] AND Xm NOT:[at 1] * 
ST TYPE:[STABILIZING] * A RULES: [a 11070] 


The application of the above statement has resulted in an extension of the 
slice. [10] A new derivational state is added. [12] The effect of this function 
is visible in this line, where the sound-set gets the attribute at. 1 added to it. 
This is the only sound-set where the conditions of the statement are fulfilled. 


It should be noted that multiple application of this statement to the same 
pStrip does not result in any further changes. 


>>> pStrip = Xm ATT a ATT à Xm Xm NOT(pStrip,stO2) 
>>> pStrip = Xm ATT a ATT à Xm Xm NOT(pStrip,stO2) 
>>> print pStrip 
p : p.O,ait 9,paTha a,udAttet 9 * 
a : hrasva O,a O,ait 9,paTha a,udAttet 9 * 
Th : Th, 1,ait 9,paTha a,udAttet 9 
:-: E ADD y * ADD y & paTha a * 
yM:[vyaktAyAM x vAci x] * 
ST TYPE:[COMPLETING] * 


p : p.O,ait 9,paTha a,udAttet 9 * 
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a : hrasva_0,a_0,ait_9,at_1,paTha_a,udAttet_9 * 
Th : Th_1,ait_9,paTha_a,udAttet_9 
:-: Xm ATT a * ATT_a & at 1 * 
Xm:[a Ol]l[hrasva O0] AND Xm NOT:[at 1] * 
ST TYPE:[STABILIZING] * A RULES: [a 11070] 


Consider now the following two statements. 


>>> st03 = Statements( 

'Xm_ATT_a * 

ATT_a & vRddhi O * 

Xm:[At 2,aic 0] AND Xm NOT:[vRddhi O] * 

ST TYPE:[STABILIZING] * A RULES: [a 11001] ') 
>>> st04 = Statements ( 

'Xm_ATT_a * 

ATT a & guNa O * 

Xm:[at 1,eG 0] AND Xm NOT:[guNa 0] * 

ST TYPE:[STABILIZING] * A RULES: [a 11002]') 


Both of them have the same signature as the previous statement. 


>>> stO3.get signature() 

'Xm ATT a. ATT à, Xm Xm NOT" 
>>> stO4.get signature() 

"Em ATT à. ATT a, Xm, Xm, NOT' 


This implies that both of them can be applied by using the same signature 
function. The execution of st03 shows that it is not applied at all. This is ob- 
vious as none of the sound-sets fulfill the required conditions. The pStrip 
remains as it is. This is evident from the output below. 


>>> pStrip = Xm ATT a ATT à Xm Xm NOT(pStrip,stO03) 
>>> print pStrip 
p : p_0,ait_9,paTha_a,udAttet_9 * 
a : hrasva_0,a_0,ait_9,paTha_a,udAttet_9 * 
Th : Th_1,ait_9,paTha_a,udAttet_9 
:-: E ADD y * ADD y & paTha a * 
yM:[vyaktAyAM x vAci x] * 
ST_TYPE: [COMPLETING] * 


p : p.O,ait 9,paTha a,udAttet 9 * 
a : hrasva O,a O,ait 9,at 1,paTha a,udAttet 9 * 
Th : Th, 1,ait 9,paTha a,udAttet 9 
:-: Xm ATT a * ATT_a & at 1 * 
Xm:[a O][hrasva O] AND Xm NOT:[at 1] * 
ST TYPE:[STABILIZING] * A RULES: [a 11070] 


The execution of st04 however brings out some changes. 


>>> pStrip = Xm ATT a ATT à Xm Xm NOT(pStrip,stO4) 
>>> print pStrip 

p : p.O,ait 9,paTha a,udAttet 9 * 

a : hrasva O,a O,ait 9,paTha a,udAttet 9 * 
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Th : Th_1,ait_9,paTha_a,udAttet_9 
:-: E ADD y * ADD y & paTha_a * 
yM:[vyaktAyAM x vAci x] * 
ST TYPE:[COMPLETING] * 


p : p.0O,ait 9,paTha a,udAttet 9 * 
a : hrasva O,a O,ait 9,at 1,paTha a,udAttet 9 * 
Th : Th 1,ait 9,paTha a,udAttet 9 
:-: Xm ATT a * ATT_a & at 1 * 
Xm:[a O][hrasva O] AND Xm NOT:[at 1] * 
ST TYPE:[STABILIZING] * A RULES: [a 11070] 


p.0,ait 9,paTha a,udAttet 9 * 
: hrasva 0,a 0,ait 9,at 1,guNa O,paTha a,udAttet 9 * 
Th : Th, 1,ait 9,paTha a,udAttet 9 
:-: Xm ATT a * ATT a & guNa O * 
Xm:[at 1,eG 0] AND Xm NOT:[guNa 0] * 
ST TYPE:[STABILIZING] * A RULES: [a 11002] 


mro 


[18] The attribute guNa 0 is now attached to the sound-set having the at- 
tribute at. 1. There is a hierarchy among the saturating statements that is de- 
cided by the dependency of their conditions together with the operational 
attachment. Thus, while st03 and st04 can be applied in any order, st04 can 
only be applied after the application of st02. 


The next statement attaches an attribute not only to just one sound-set, but to 
several of them. 


>>> st05 = Statements( 
'X_ATT_a * 
ATT_a & bhvAdi_O * 
X:[bhU a,paTha a,ji a,pUG a,dheT a] AND X NOT:[bhvAdi O] * 
ST TYPE:[STABILIZING] * ') 
>>> stO5.get signature() 
'X ATTa ATTa X X NOT" 


This statement attaches the group name bhvAdi 0 for the bhvddi-gana of the 
verbal roots. Here, for the sake of readability, I have not listed all the roots that 
are mentioned in this group. 


>>> from signatureFunctions import X ATT a. ATTa X X NOT 
>>> pStrip = X ATT a, ATT a X X NOT(pStrip,stO5) 
>>> print pStrip 


p_0,ait_9,bhvAdi_0O,paTha_a,udAttet_9 * 
hrasva 0,a O0,ait 9,at 1,bhvAdi O,guNa O,paTha a, 
udAttet 9 * 
Th : Th 1,ait 9,bhvAdi O,paTha a,udAttet 9 
:-: X ATT à * ATT à & bhvAdi O * 
X:[bhU a,paTha a,ji a,pUG a,dheT a] AND X NOT:[bhvAdi O0] * 
ST TYPE:[STABILIZING] * 


mro 
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The attribute bhvAdi_0 is added to all three of the sound-sets that constitute 
the component paTha_a. In the above output, I have deleted the earlier deriva- 
tional states to save space and to aid readability. 


The next statement that attaches the attribute dhAtu_0 is similar. 


>>> st06 = Statements ( 
'X_ATT_a * 
ATT_a & dhAtu_O * 
X:[bhvAdi O,adAdi O,juhotyAdi O,divAdi O,svAdi O, 
tudAdi O,rudhAdi O,tanAdi O,kryAdi O,curAdi O0] AND 
X NOT:[dhAtu 0] * 
ST TYPE:[STABILIZING] * A RULES: [a, 13001] ') 
>>> st06.get_signature () 
UX ATT a. ATT a- -X X NOT’ 
>>> from signatureFunctions import X_ATT_a__ATT_a__X_X_NOT 
>>> pStrip = X_ATT_a__ATT_a__X_X_NOT(pStrip,st06) 
>>> print pStrip 


p : p.O,ait 9,bhvAdi O,dhAtu O,paTha a,udAttet 9 * 

a : hrasva O,a O,ait 9,at i,bhvAdi O,dhAtu O,guNa O, 
paTha a,udAttet 9 * 

Th : Th 1,ait 9,bhvAdi O,dhAtu O,paTha a,udAttet 9 

:-: X ATT à * ATT à & dhAtu O * 
X:[bhvAdi O,adAdi O,juhotyAdi O,divAdi O,svAdi O, 
tudAdi O,rudhAdi O,tanAdi O,kryAdi O,curAdi O0] AND 

X NOT:[dhAtu O0] * 
ST TYPE:[STABILIZING] * A RULES:[a, 13001] 


5.2.3 Completion 


The next statement introduces the component 1aT 0. This component is 
added after the language-component Xi. The semantic condition for its ad- 
dition is stated in the condition yM: [vartamAna x]. This condition is satisfied 
once the user confirms her or his intention to express vartamAna x or present 
tense. 
>>> stO7 = Statements( 

'Xi_ADD_y * 

ADD y & laT_O * 

Xi:[dhAtu 0] AND Xj_NOT:[lakAra_9] AND yM:[vartamAna x] * 

ST_TYPE: [COMPLETING] * A RULES:[a 32123]') 
>>> stOT7T.get signature() 

'Xi ADD y. ADD y. Xi Xj NOT, yM' 


The above statement is applied through the corresponding signature function. 
It is a completing statement and therefore a new slice is added. Line [16] notes 
the slice boundary. 


v9. 00 doo] Hi 0 hd eH 


[220 2020 
do 07i FP WNF c 


v 00 do 0i FE WN KH 


Re 
m O 


5.2 Processes of grammar 125 


>>> from signatureFunctions import Xi_ADD_y__ADD_y__Xi_Xj_NOT_yM 
>>> pStrip = Xi_ADD_y__ADD_y__Xi_Xj_NOT_yM(pStrip,st07) 
>>> print pStrip 


: p.O,ait 9,bhvAdi O,dhAtu O,paTha a,udAttet 9 * 
a : hrasva O,a O,ait 9,at 1,bhvAdi O,dhAtu O,guNa O, 
paTha a,udAttet 9 * 
Th : Th 1,ait 9,bhvAdi O,dhAtu O,paTha a,udAttet 9 
:-: X ATT a * 
ATT à & dhAtu O * 
X:[bhvAdi O,adAdi O,juhotyAdi O,divAdi O,svAdi O, 
tudAdi O,rudhAdi O,tanAdi O,kryAdi O,curAdi 0] AND 
X NOT:[dhAtu 0] * 
ST TYPE:[STABILIZING] * A RULES:[a, 13001] 
p.0,ait 9,bhvAdi O,dhAtu O,paTha a,udAttet 9 * 
hrasva 0,a O0,ait 9,at 1,bhvAdi O,dhAtu O,guNa O, 
paTha a,udAttet 9 * 
Th : Th 1,ait 9,bhvAdi O,dhAtu O,paTha a,udAttet 9 * 
l1 : 10,Tit 9,ait 9,1aT O,lakAra 9 
:-: Xi ADD y * 
ADD y & laT_O * 
Xi:[dhAtu 0] AND Xj NOT:[1akAra 9] AND yM:[vartamAna x] * 
MEXGRP 0001 * A RULES:[a. 32123] 


"o 


mro 


[21] The above statement also extends the language-component and a new 
sound-set for laT 0 is added. laT_0 is a Tit 9 and ait_9 lakAra, 9 and these 
attributes are already specified in the database. 


The next statement specifies whether the components within a particular 
language-component form a sentence in active, passive or middle voice. 
Again, the decision to employ active voice is reached on the basis of the se- 
mantic condition, which the user must address directly. 


>>> st08 = Statements ( 
'X_ATT_a * 
ATT_a & xkartR_9 * 
X:[dhAtu 0] AND X NOT:[xkarman 9][xbhAva 9][xkartR 9] AND 
X M:[kartRprayoga x] * 
ST TYPE:[STABILIZING] *') 
>>> st08.get signature() 
'X ATT a, ATT a X X M X NOT' 


>>> pStrip = X ATT a, ATTa X X M X NOT(pStrip,stO8) 
>>> print pStrip 


p : p_0,ait_9,bhvAdi_0O,dhAtu_0O,paTha_a,udAttet_9,xkartR_9 * 
a : hrasva O,a O,ait 9,at 1,bhvAdi O,dhAtu O,guNa O, 
paTha a,udAttet 9,xkartR 9 * 
Th : Th 1,ait 9,bhvAdi O,dhAtu O,paTha a,udAttet 9, 
xkartR 9 x 
1 : 10,Tit 9,ait 9,1aT O,lakAra 9 
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:-: X_ATT_a * 
ATT_a & xkartR 9 * 
X:[dhAtu 0] AND X NOT:[xkarman 9][xbhAva 9][xkartR 9] AND 
X M:[kartRprayoga x] * 
ST_TYPE: [STABILIZING] * 


The following statement attaches the attribute parasmaipada O to the 
language-component. 


>>> st09 = Statements ( 
'X_ATT_a * 
ATT_a & parasmaipada O * 
X:[dhAtu Ol[xkartR 9] AND X NOT:[Atmanepada O0] * 
ST TYPE:[STABILIZING] *') 
>>> stO9.get signature() 
'X ATT a ATTa X X NOT' 
>>> from signatureFunctions import X ATT a. ATT a X X NOT 
>>> pStrip = X ATT a ATT a X X NOT(pStrip,stO9) 
>>> print pStrip 


p : p.O,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O, 
udAttet 9,xkartR 9 * 

a : hrasva O,a O,ait 9,at 1,bhvAdi O,dhAtu O,guNa O, 
paTha a,parasmaipada O,udAttet 9,xkartR 9 * 

Th : Th 1,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O0, 
udAttet 9,xkartR 9 * 

l : 10,Tit 9,ait 9,1aT O,lakAra 9 

:-: X ATT a * 

ATT à & parasmaipada O * 
X:[dhAtu O][xkartR 9] AND X NOT:[Atmanepada 0] * 
ST_TYPE: [STABILIZING] * 


The following statement provides for the substitution of 1aT 0 by the third 
person singular suffix tip O0. 


>>> st10 = Statements( 

'Xi_REP_y * 

REP y & tip O * 

Xh:[dhAtu O][parasmaipada O] AND Xi:[lakAra_9] AND 

yM:[prathama Ol[ekavacana 0] * 

ST TYPE:[COMPLETING] * A RULES: [a 34077][a. 34078] ') 
»»» ST10 obj.get signature() 

'Xi REP y. REP y. Xh Xi yM' 


Again, the decision to opt for the third person and singular is taken by the 
user on the basis of the semantic conditions prathama, 0 and ekavacana 0 
respectively. 


>>> from signatureFunctions import Xi REP y REP y. Xh Xi yM 
>>> pStrip = Xi REP y REP y  Xh Xi yM(pStrip,sti10) 
>>> print pStrip 
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p : p.O,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O, 
udAttet 9,xkartR 9 * 
a : hrasva O,a O,ait 9,at 1,bhvAdi O,dhAtu O,guNa O, 


paTha a,parasmaipada O,udAttet 9,xkartR 9 * 
Th : Th, 1,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O, 
udAttet 9,xkartR 9 * 


l1 : 1 O0,REPLACED 9,Tit 9,ait 9,laT O,lakAra 9 * 
t : t O,pit 9,tip O * 

i : hrasva O,i O,pit 9,tip O 

:-: Xi REP y * 


REP y & tip O * 

Xh:[dhAtu Ol[parasmaipada O] AND Xi:[1akAra 9] AND 
yM:[prathama Ol[ekavacana 0] * 

ST TYPE:[COMPLETING] * A RULES: [a, 34077] [a, 34078] 


Replacement is implemented by attaching the attribute REPLACED 9 to those 
parts that are replaced (line [11]) and adding the replacement components at 
the appropriate index (line [12-13] ). 


The following statement attaches the attribute tiG O to the third person sin- 
gular suffix tip O0. 


>>> st11 = Statements( 
'X ATT a * 
ATT a & tiG O * 
X:[tip O,tas O0,jhi O,sip O,thas O,tha O,mip O,vas O,mas O0, 
ta O,AtAm O,jha O,thAs 1,AthAm O,dhvam O,iT 1,vahi O,mahiG 0] 
AND 
X NOT:[tiG O] * 
ST_TYPE: [STABILIZING] + A RULES: [a 34078] [a 11071]') 
>>> stii.get signature() 
'X ATTa ATTa X X NOT' 
>>> from signatureFunctions import X ATT a | ATTa X X NOT 
>>> pStrip = X ATT a ATT a X X NOT(pStrip,sti11) 
>>> print pStrip 


p : p.O,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O, 
udAttet 9,xkartR 9 * 
a : hrasva O,a O,ait 9,at i,bhvAdi O,dhAtu O,guNa O, 


paTha a,parasmaipada O,udAttet 9,xkartR 9 * 
Th : Th 1,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O0, 
udAttet 9,xkartR 9 * 
l1 : 1 O,REPLACED 9,Tit 9,ait 9,laT O,lakAra 9 * 
: t O,pit 9,tiG O,tip O * 
: hrasva O,i O,pit 9,tiG O,tip O 
:-: X ATT a * 
ATT a & tiG O * 
X:[tip O,tas O,jhi O,sip O,thas O,tha O,mip O,vas O,mas O0, 
ta O,AtAm O,jha O,thAs 1,AthAm O,dhvam O,iT 1,vahi O, 
mahiG 0] AND 
X NOT:[tiG 0] * 
ST TYPE:[STABILIZING] * A RULES:[a 34078][a 11071] 
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To a tiG_O or a zit_9 component, the attribute sArvadhAtuka O0 is attached 
by the following statement. 


>>> sti2 = Statements ( 
‘XATT à + 
ATT_a & sArvadhAtuka O x 
X:[tiG O,zit 9] AND X NOT:[sArvadhAtuka O0] * 
ST TYPE:[STABILIZING] * A RULES: [a, 34113] ') 
>>> sti2.get signature() 
"XATT A ATT ao. A X NOT" 


The application of this statement is similar to the other attachments of the 
attributes. 


>>> from signatureFunctions import X_ATT_a__ATT_a__X_X_NOT 
>>> pStrip = X ATT à, ATT a X X NOT(pStrip,sti12) 
>>> print pStrip 


p : p.O,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O, 


udAttet 9,xkartR 9 * 
a : hrasva O,a O,ait 9,at 1,bhvAdi O,dhAtu O,guNa O, 


paTha a,parasmaipada O,udAttet 9,xkartR 9 * 
Th : Th 1,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O0, 
udAttet 9,xkartR 9 * 
l1 : 1 0,REPLACED 9,Tit 9,ait 9,laT O,lakAra 9 * 
: t O,pit 9,sArvadhAtuka O,tiG O,tip O * 
: hrasva O,i O,pit 9,sArvadhAtuka O,tiG O,tip O 
:-: X ATT a * 
ATT à & sArvadhAtuka O * 
X:[tiG O,zit 9] AND X NOT:[sArvadhAtuka 0] * 
ST TYPE:[STABILIZING] * A RULES: [a, 34113] 


Finally, the infix zap 0 is introduced by the following statement. 


>>> st13 = Statements( 
'Xi ADD y * 
ADD y & zap O * 
Xi:[dhAtu 0] AND Xj:[sArvadhAtuka 0] AND Xj M:[kartR O] * * 
A RULES: [a 31068] ') 
>>> sti3.get signature() 
'Xi ADD y. ADD y. Xi Xj Xj M' 


The conditions also provide the index where the new components should 
properly be added. 


>>> from signatureFunctions import Xi ADD y ADD y. Xi Xj Xj M 
>>> pStrip = Xi ADD y. ADD y Xi Xj Xj M(pStrip,sti3) 
>>> print pStrip 


p : p.O,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O, 
udAttet 9,xkartR 9 * 
a : hrasva O,a O,ait 9,at i,bhvAdi O,dhAtu O,guNa O, 
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paTha a,parasmaipada O,udAttet 9,xkartR 9 * 
Th : Th 1,ait 9,bhvAdi O,dhAtu O,paTha a,parasmaipada O0, 
udAttet 9,xkartR 9 * 
hrasva 0,a O,pit 9,zap O,zit 9 * 
l O,REPLACED 9,Tit 9,ait 9,1aT O,lakAra 9 * 
t O,pit 9,sArvadhAtuka O,tiG O,tip O * 
: hrasva O,i O,pit 9,sArvadhAtuka O,tiG O,tip O 
:-: Xi ADD y * 
ADD y & zap O * 
Xi:[dhAtu 0] AND Xj:[sArvadhAtuka 0] AND Xj M:[kartR O] * * 
A RULES:[a, 31068] 


| cnp» 


The above steps demonstrate the dynamics of the process of derivation. At 
each step, a larger number of characterising statements are employed. The 
main algorithm of the derivational process is summarized in the following 
main function. 


def execute(): 
pStrip = initialize() 
while 1: 
pStrip = saturate (pStrip) 
possible statements = interpret (pStrip) 
if len(possible_statements) == 0: 
return pStrip 
else: 
statement = select(pStrip,possible statements) 
pStrip = apply(pStrip,statement) 


[1] The main function which returns a process-strip. [2] The function 
initialize() initializes the process-strip. [3] The third line specifies a loop. 
[4] At this stage, a given process-strip is saturated, i.e. attributes are attached 
to the language-components or sound-sets. [5] Once the process-strip is satu- 
rated, it is tested for any possible completing statement that could be applied. 
The list of all such candidates is stored in possible_statements. [6] In case 
the list is empty, i.e. there is no statement that may be applied, [7] then the 
process-strip is returned. Otherwise, [9] one statement is selected, depending 
upon the semantic considerations and the intention (vivaksa) of the user. [10] 
Finally, that statement is applied and the process-strip is updated. 


Appendix A 
The Systematic Approach 


In this appendix I intend to work out the systematic approach followed by 
the ancillary disciplines associated with the Vedas.! In particular, I examine 
the Rgvedapratisakhya? the Baudhayanaéóulbasütra? and the Astadhyayr.* 
My emphasis is to look into the formal aspects of their basic techniques. 
The chief outcome is the certainty that they follow a common methodology. 
Further, it leads to the determination and specification of the details of this 
systematic approach. The results are of significance as they provide the basis 
for a formal perspective on the Astadhyayi of Panini. 


In the following, I briefly mention a conceptual plan for the subsequent 
investigations, including the new terms introduced by me. I shall use 
typewriter font whenever I want to emphasize their terminological char- 
acter. These terms are needed to formulate the details of the systematic 
approach. They reflect the common character of the content and processes 
of these disciplines, and are needed to avoid the imposition of specific terms 
from any particular tradition with their established denotations for the 
general concepts. Their introduction is all the more necessary as the present 
study aims to point out features spanning across several disciplines. 


The starting point is the observation that the above mentioned texts 
aim towards retention of a given phenomenon. By a given phenomenon I 


! The ancillary disciplines or the Vedangas (lit. limbs of the Vedas) are: Siksa (phonetics), 
Chandas (prosody), Vyakarana (grammar), Nirukta (etymology), Kalpa (instructions on ritual 
practice) and Jyotisa (astronomy). See (Gonda 1975 p. 34). For a summary of the literature 
on phonetics and grammar, see (Scharfe 1977), on ritual practices (Gonda 1977) and for 
astronomy, see (Pingree 1981). 

? For our study, I use the Rgvedapratióakhya with the commentary of Uvata edited by Viren- 
drakumar Varma (2007) and occasionally an earlier edition by Mangal Deva Shastri (1959). 
3 For the purpose of this study, I am primarily using the edition of S. N. Sen and A. K. Bag 
(1983). 

^ For editions of the Astadhyayi used by me, see p. 145. 
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mean any existing linguistic or cultural practice established over a number 
of generations. It is something which one has received as the standard and 
would like to protect and pass on intact to the next generation, for example, 
the recitation of the Vedic mantras. 


In order to achieve the goal of retention, these texts follow a common 
systematic approach which consists of two interdependent and complemen- 
tary processes. The first one is an analysis of a given whole into constituent 
components and fundamental units. The other one is synthesis through 
rule-based combination of components and units to regain the given whole. 


A basic operation which is needed for these processes is characterization of 
components and units. This involves attaching attributes to them to impart 
an identity and/or associating some information which they subsequently 
bear. The operations of characterization and combination are executed once 
certain conditions are satisfied. 


The apparent cyclical exercise of first analyzing and then synthesizing 
comprehends the given phenomena in a systematic manner. This gives rise 
to an interconnected structure of components and units together with the 
conditioned operations. Such structures have the tendency to last longer and 
are explained on the basis of the underlying system. Structures facilitate 
variations and change. 


In the following, I intend to demonstrate this systematic approach by 
means of examples to determine its chief features. 


A.1 Analysis and synthesis 


To begin with, we examine how the above mentioned texts perform the com- 
plementary processes of analysis and synthesis for the sake of comprehension 
of a given phenomenon. 


A.1.1 Syllables (Aksaras) 


The first example we take up is the process of analyzing a given utterance into 
constituent syllables. Consider the following mantra of the Rgveda : 


aR, | (RV. 3.62.10). 
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Itis the given whole which can be analyzed into components. The components 
in this case are syllables (aksaras). The Rgvedapratisakhya provides rules for 
analysis of a given utterance into constituting syllables. These are given as 
follows: 


1. Both the short as well as long vowels (svaras) form a syllable.” 


2. The nasal sound (anusvara) and consonants (vyafijana) are part of the syl- 
lable.® 


3. The consonants that are in between two vowels are part of the latter 
vowel.” 


4. The nasal sound and aspirated sound (visarjaniya) form part of the pre- 
ceding vowel.® 


5. The first phoneme of a consonant conjunct (samyoga) if in between two 
vowels, optionally forms a part of the first vowel.? 


An application of the above rules yields the following analysis. The separa- 
tions are shown by the danda (1) sign. 


ALIS RIA IT (umi 


It should be noted that the components are dependent upon the given whole 
and in this sense, do not have an independent existence. They have relevance 
only as long as they can be combined to form the original. 


The results of analysis are not unique. This is because there are more 
than one possible options to dissect a given whole. Another variation of the 
above analysis could be: 


qicifiigidixiumi 


The process of synthesis in this case is simple. It consists of only one rule, that 
of uttering or placing the analyzed components one after another, without any 
pause or space in between, while following the original sequential order. Both 
the processes of analysis and synthesis are rule-based. The rules of analysis are 
mentioned explicitly while the rules for synthesis are understood implicitly.!? 


5 gv cast | (RVPr. 1.19). 

6 ARAN eere WANA, | (RVPr. 1.22). 

7 SIR AMIGA | (RVPr. 1.23). 

8 GaAs | (RVPr. 1.24). 

9 gafas | (RVPr. 1.25). 

10 The rules to determine the syllables in the Vajasaneyipratisakhya are more detailed. Apart 
from the above mentioned cases, it also include instances when the consonants result from 
reduplication. For example, parssvyam (VS. 25.5) consists of pars | svyam where the first 


reduplicated consonant ś belongs to the former syllable and the second to the latter. See 
(VPr. 1.99-106). 
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A.1.2 The falcon shaped fire altar (Syenacit) 


We now consider an example from the Baudhayanaéulbasütra. The Sulba- 
sütras were primarily meant for piling-up of the fire altar (agnicayana) and 
preparation of the sacrificial arena.!! Accordingly, these are usually men- 
tioned as part of the general instructions for performance of the Vedic rituals 
($rautasütras).? The Baudhayanaáulbasütra describes the construction of 
the sacrificial ground (mahavedi), provides for the distance, relative positions 
and shape of the pits for the three fires—the garhapatya, the ahavaniya and the 
daksinagni as well as various altars (vedis), including piling-up of the bricks 
for fire altars (citis). 


Consider the syenacit or the fire altar in the shape of falcon with curved 
wings and extended tails.'? 


It is an altar having the above form and consisting of 1000 bricks piled in five 
layers, with 200 bricks in each layer. The entire area of the altar should be 
75 times the area of a square with length equal to the height of the sacrificer 
(yajamana).'* From a systematic point of view, this syenacit altar is the given 
whole which is to be retained. The Sulba-sütra enunciates its components. The 
figure is first divided in four parts: the main core (atman), the head (siras), the 
tail (puccha) and the two wings (paksa). 


11 On the terms śulba (or áuloa) and Sulbasütra see (Michaels 1978 p. 162-165). For an intro- 
duction to piling up of the fire altar (agnicayana), see (Michaels 1978 p. 36-43). Frits Staal 
(1983) provides a more detailed description of the ritual processes. 

12 There are Sulbasütras ascribed to: Baudhayana, Apastamba, Vadhüla (belonging to the 
Taittirtyasamhita of the Krsnayajurveda), Manu and Varaha (Maitrayanisamhita of the 
Krsnayajurveda) as well as Katyayana (Vajasaneyisamhita of the Suklayajurveda). Their 
relative as well as absolute chronology is unclear (Michaels 1978 p. 51-57). Scholars, how- 
ever, place them between 5th to 2nd century BCE, with Baudhayana$ulbasütra being one 
of the earliest (Pingree 1981 p. 4-5). 

13 This is mentioned in the tenth chapter (BSuS. 10.1-20). Here I follow the number according 
to the edition by S. N. Sen and A. K. Bag (1983). 

14 See, for example, (BSuS. 10.4-9). 
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The analysis is not unique. It could have been analyzed into a different set 
of components. Moreover, the analyzed components are dependent upon 
the given whole and lack independent existence or usage in the sacrificial 
activities. 


The analyzed components are further dissected into five different shapes of 
smaller components that correspond to the five types of bricks. 


1. The first type (B1) are square bricks having the size of one-fourth of a 
purusa. They are therefore called caturthi bricks. 


2. The second type (B2) are triangular bricks obtained by cutting the caturtht 
bricks (B1) along the diagonal. These are called ardha bricks as they have 
half the area of the B1 square bricks. 


3. The third type (B3) are again triangular bricks obtained by cutting B1 
across the diagonals. These are termed pādyā bricks having one-quarter 
of the area of B1. 


4. The fourth type (B4) are four-sided quarter bricks formed by adding 
an isosceles triangle of two equal sides with length 1 pada and the hy- 
potenuse with length v2 pada combined with a rectangle with along the 
length having measurements of length 1 pada and breadth 14 pada. 


5. The fifth type (B5) are bricks obtained by joining two B4 bricks along 
their longest side. Its shape is like the beak of a swan, hence it is called 
hamsamukhi. 


The components in this case are evident and are literally the building blocks 
of the given whole. The components depend upon the original. They do not 
have independent employment or usage in the sacrificial rituals. Further, the 
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components are also not unique. There can be a different set of components, 
constituting the same whole. 


The second set of components, namely the caturthi-bricks etc. are parts 
of bigger components like the main core (aütman) or the head (Siras) etc. 
Although more granular than the first set, the components belonging to the 
second set are formally speaking not different from the first.!° 


The process of synthesis is rule based. The nature of rules is placing the 
components in a manner so that the original form is regained. These are 
specified, for example, in (BSuS. 10.10-20). 


A.1.3 Samhita-patha and Pada-patha 


The main function of Prati$akhyas is to provide for the synthesis of continu- 
ous recitation (samhita-patha) from the word-for-word recitation (pada-patha).! 


An application of the process of analysis to the continuous recitations 
(samhita-patha)" of the mantras of the Rgveda, yields individual padas or 
words. Their sequential enunciation is called word-for-word recitation (pada- 
patha).'® The given whole in this case, therefore, is the continuous recitation 
of any mantra of the Rgveda and their analyzed elements i.e. the padas are its 
components. 


The task of analyzing a given continuous expression into individual 
padas is largely a heuristic process and there are no rules following which 


15 See below the discussion in sectionA.2. 

16 The name Pratióakhya indicates that they correspond to a particular branch (Sakha) 
of the Veda. Among the available ones, the Rgvedapratisakhya (attributed to Saunaka), 
Taittirlyapratisakhya, Vajasaneyipratisakhya and Rktantra are the most important. The 
chronology of these texts is not setteled. 

17 The definition of samhita in Rgvedapratióakhya indicates the process of combining the 
components: qareareaqiaf: HRA Ad A store! (RVPr. 2.2). [Samhita is that which com- 
bines the final parts of the padas with the initial parts of the following ones, without the 
intermission of time.] On the other hand, the Vajasaneyipratisakhya does not use the com- 
ponents padas but defines it on the basis of continuous recitation of the phonemes. SUA, 
THAT: dear (VPr. 1.158). [Samhita is the conjunction of phonemes uttered in one breath.] 
18 The collections are respectively called the Samhita-patha (1933) and the Pada-patha 
(1947). Vajasaneyipratisakhya terms the analyzed position as asamhita in contradistinction 
to the combined state as samhita. qui: dfa: ı qast: | (VPr. 1.155-156). See (Varma 
1987 p. 90-92). 
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the individual components can be identified.” Their recognition is possible 
as they are meaningful components that are used interchangeably in the 
recitations.” It is facilitated by the process of concurrent presence (anvaya) 
and concurrent absence (vyatireka)?! As an example, we take the first mantra 
in the beginning of the Rgveda (1.1.1) and provide first the given whole and 


then its analyzed components: 


Samhita-patha (the given whole) 
agnim ile purohitam yajfiasya devam rtvijam | hotaram ratnadhatamam / 


Pada-patha (analyzed components) 
agnim | ile | purah'hitam | yajfiasya | devam | rtvijam | hotaram / ratna'dhatamam | 


Pratisakhya texts provide rules for combination of padas in order to re- 
gain the continuous recitation or Samhità-patha. The question as to what 
comes first—whether Pada-patha has its origins in Samhita-patha or vice 
versa—is discussed by the commentators. The assertion Ha Was: in 
(RVPr. 2.1) can be understood in two different ways: samhita whose prakrti 
(origin) is pada or samhita, which is the prakrti (origin) of pada. Commenting 
on this, Uvata says that: 
samhita, whose constituents are padas, is here a modification of the constituting padas. 


For example, the modifications satva or natva occur in samhita only. Because they are 
the constituents, therefore, padas are established original forms.” 


Yaska in his Nirukta also states that “samhita is the one having padas as its con- 
stituent and all the branches of the Veda consider it to be so."?* Commenting 


P There are, however, some exceptions, for example, the fifth chapter of the Vā- 
jasaneyipratisakhya which gives some rules to identify the padas within the Samhita-patha. 
See (Varma 1987 p. 335-363). 

? Vajasaneyipratisakhya defines a pada as that component, which is “capable of possess- 
ing an independent sense". eni: Wa (VPr. 3.2). Yaska in Nirukta mentions four kinds of 
padas : nouns (naman), verbs (akhyata), prepositions (upasarga) and indeclinables (nipata). 
rer. Yeates TARTS Aaaa AAA safe! (N. 1.1). K. V. Abhyankar provides a 
compendium on Pada-patha and specifies pada (meaningful unit), their compact expres- 
sion (vrtti) and their types, vigraha or analysis of the conjoined expressions, resulting nodes 
or breaks (parvan), stops (avagraha), accentuation (svara) and other special features of Pada- 
patha in (Abhyankar 1974). 

21 On this process, see (Cardona 1967a p. 313-352). 

?? The transliteration I use is the one generally adopted in western scholarship. I have not 
added accent marks here. George Cardona (1997 p. li-Ixiv) discusses the different translit- 
eration issues. Peter M. Scharf and Malcom D. Hyman (2012) provide a detailed study of 
various issues involved in encoding Sanskrit in computers. 

23 darfst Spier Bea: Sears: A Wesen: tera fear: | wem fe Tera fear: Pears un 
Hated | sien qaem Talat ects | Uvata on (RVPr. 2.1). 

24 qap: Seal | WM aoa mia (N. 1.17). 
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upon this, Durgacarya takes up the question in a detailed manner and puts 
forward two possible cases: 


1. "That, which is the cause of padas, that (samhita) is pada-prakrti. Why? Be- 
cause padas are formed out of samhita. Therefore, some consider samhita to 


be the original form (prakrti) and padas to be their modifications (vikara)."?* 


2. "Others, however, understand the statement padaprakrtih samhita to be 
samhita, whose cause are the padas. Why? Because samhitd is gained out 
of the combinations of padas only. Therefore, padas are the original form 
and samhita is their modification."?6 


He further raises the question, which option is better: to consider padas to be 
the original form and the samhita to be their modification or vice versa and 
decides for the latter giving several justifications based on the earlier usage 
of Samhita-patha." It follows that the Pratióakhya texts perform the task 
of analysis of samhita-patha into pada-patha and then provide a rule based 
process to combine the padas to regain the samhita-patha. 


To sum up, we can identify the processes of analysis of a given whole 
into its components, and complementary to it the synthesis from components 
to the given whole in the examples taken from different texts. In particular 
we noticed the following instances: 


1. Sanskrit expression into syllables (aksara) : The example from the 
Pratisakhya shows that both the process of analysis of a Sanskrit utterance 
into syllables as well as the reverse process of synthesis are rule based. 


2. In case of a ritual formation like the syenacit the process of analysis into 
components like the caturthi bricks is teleological. The reverse process of 
synthesis on the other hand is rule based. 


3. The analysis of samhita-patha into the pada-patha is again teleological, while 
synthesis is rule based. 


4. The process of analysis of Sanskrit expressions into components like the 
roots (prakrti) and suffixes (pratyaya) is again teleological, while synthesis 
is rule based. 


We now look into the constituents that are gained by further analyzing the 
components. 


Na NEN 


25 qae qt ate: Sr Tape: | Fas sere? feet R aT fed | THATS septi: Ware aA 
Fed | Durgacarya on (N. 1.17). 


26 IR F: Tarp: vafer verf pfe: aa ARA fi arom? wareds fe deem dear 
wala | aera Galea fe pA: RAR Durgacarya on (N. 1.17). 

7 ore Fe gener mia: Wait splat ifr frorcerga at france Talat pled eec afe? gd 
gka: spa sara: | ome È SRO? gud | Ha aaa: Gass: efeddariersqd a ge: | 
Durgacarya on (N. 1.17). 


A.2 Components and units 139 


A.2 Components and units 


The process of analysis is carried further in the case of linguistic as well 
as geometrical components. The components are further sub-divided into 
fundamental units. 


The linguistic components are analyzed into the basic sound units or 
the phonemes. For example, 

Components: tat / sa | vi / tur [ va | re | nyam | 

Units: t/a/t/slalvliltlulrlolalrlelnlylalml 


As in the case of components, the units are dependent upon the given 
phenomena. For example, the set of phonemes is dependent upon the type 
of sounds comprising a particular language. For some other language, this 
set may be different)? But, unlike the components, units are unique and 
fundamental. Uniqueness implies that there cannot be two different sets of 
units corresponding to the same phenomenon. And fundamental means that 
they can not be analyzed further. 


The geometrical components are also further dissected into more basic 
units, namely points and lines. For example, the padyd brick is analyzed as 


follows: A FN 


The set of units in this case would consist of point, line, surface etc. It should 
be noted, that as in the case of phonemes where, for example an element 
of the set represents a type that can have several instances, having different 
length and accents, similarly here, the elements stand for the fundamental 
types which may have different length or area etc. Thus, in this case as well, 
the set of units is fundamental and unique. 


We have identified two different categories of constituents of a given 
phenomena. 


1. The first category is what we name as components. Examples are pada, 
prakrti, pratyaya, aksara, siras, puccha, caturtht-bricks. 


2. The second category is what we define as the units. The examples of 
units are phonemes for linguistic-components and lines and points for 
geometrical-components. 


28 For example, certain sounds like /h, jihvamültrya, upadhmaniya are a part of the Vedic 
expressions but not of classical Sanskrit. 
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To elucidate the difference between the components and units, we note that 
the units are fundamental constituents that cannot be divided further. What 
is meant here is that if one were to analyze them further, the same type would 
result. They are physical entities with some audible/ visible form. In the case 
of linguistic units, the form would be the sound which one hears, and for 
geometrical units it is the visible shape. Further, we stipulate that a unit is 
characterized only through its form. In order to specify this, we need first to 
look into the process of characterization of units. Characterization in general 
is the process of identifying any object on the basis of certain features that 
help in differentiating it from dissimilar objects, and relating it with similar 
ones. The characteristics which identify units are related to their physical 
form. We call them features. 


Let us look into the definitions in the Rgvedapratisakhya where the 
fundamental units are defined. These units, in the case of linguistic phenom- 
ena are the phonemes. The manner in which the characterizing features are 
assigned to them is by direct specification of the type: "the phoneme x has 
the feature f". If more than one phoneme shares the same feature then they 
are referred to collectively. In order to refer to them, a list is provided at the 
beginning. The rules of the Rgvedapratisakhya refer to such a list. As an 
example, consider the very first rule: astau samanaksaranyaditah (RVPr. 1.1) 
[Eight are samanaksara in the beginning]. This statement attaches the feature 
samanaksara (monopthongs) to the phonemes: { a, à, r, f, i, 1, u, ü }. Here, 
the relevant phonemes are put in a group. The name of the group is then the 
feature which is attached to the individual unit belonging to that set. In a 
similar manner, other features like sandhyaksara (dipthongs), svara (vowels), 
vyanjana (consonants), sparsa, varga (the five groups of five each), antahstha 
(semi-vowels), üsman (spirants), aghosa (sonant, voiceless), sosman (aspirates), 
kanthya (velar) etc. are assigned to the respective phonemes.” 


Here, one must note the difference between definition of some sound 
and its specification within some linguistic expression. Consider the example 
of nasal sound (anunasika). It is defined in that it is uttered in the list of 
sounds and later named or referred to as anunasika.?? In the Astadhyayr, this 
defining characteristic about its phonetic form is mentioned as that utterance 
which is spoken simultaneously with mouth and nose.*! Its specification, on 
the other hand, is the determination that in a particular expression, some 
specific sound happens to be anunasika. 


29 See (RVPr. 1.1-18). 
30 AJMARA: (RVPr. 1.14) [The last phoneme of each of the 5 groups is anundsika]. 
31 PAARA IAE: !2.2.€1l » from mouth and nose uttered sound is anundsika. 


A.2 Components and units 141 


A.2.1 Characterization 


Let us now look at the characterization of components. A component is 
composed out of one or more units. In the case of linguistic components, 
this combination is a sequential concatenation of the individual sound units. 
Characterization of its physical/phonetic form, therefore, is in terms of the 
features of the constituent units. There are, however, other characteristics that 
are assigned to components. We call them attributes in order to distinguish 
them from the features. While features correspond only to those properties 
that refer to the physical form of a unit, attributes refer to the following 
additional aspects. 


There are attributes whose assignment involves conditions which de- 
pend upon the distribution of phonemes within an expression. One such 
attribute is guru (heavy syllable). Apart from syllables having dirgha (long) 
vowel, the other ones, namely the one having hrasva (short) vowel, are also 
guru if samyoga (consonant cluster) or anusoüra (nasal sound) follows.? 
There are attributes which depend upon the distribution of components, 
for example, the term anga which is attached to that part which is before 
the suffix for which some operation is specified.? The most common condi- 
tion for attributes, however, is on the basis of their belonging to a particular 
set. For example, the group of suffixes (k)ta and (k)tavat(u) are called nistha.*4 


The question that may be asked here is, why do we need these attributes? One 
requires attributes to formulate the rules of synthesis. It is very common that 
commentators mention the instances of rules where the attributes are used. 
For example, in the very first rule of the Rgvedapratisakhya, the purpose of 
coining the attribute samanaksara is mentioned to be its use in formulation of 
a subsequent rule.? Thus, the main justification for specifying an attribute is 
its use in the formulation of the system. We would not need attributes if we 
do not want to formulate our rules in terms of them. Attributes, therefore, 
contain and represent grammatical information. They encode information 
which originates from the system. 


32 p für areata) siat ean Tea Rot anis (RVPr. 1.20-21). 

35 qenrercaafafadanfa ScTdsqu 12.9.13 > that part which enjoins a pratyaya based opera- 
tion, before that the sequence is anga. 

34 spei NSI 9.2.38 > (k)ta and (k)tavat(u) are nistha. 

35 GAARA: Tas FATAL Gates Ze! Uvata on (RVPr. 1.1). The purpose of the term 
samanaksara is its use in the rule QATAR WAÀ (RVPr. 2.15). 
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A.2.2 Combination of units and components 


In section A.1 we noted that the components are combined in a rule based 
manner which results in the synthesis of the original given whole. Thus, the 
syllables are combined to form the original linguistic expression. The padas 
of the Pada-patha are combined following the rules of the corresponding 
Pratisakhya to regain the original Samhita-patha. The bricks of the syenacit 
like the caturthi, ardha, padya etc. are combined to form the head (siras), the 
main core (atman) etc. which again are combined to form the original $yenacit. 
The components like roots (prakrti) and suffixes (pratyaya) are combined to 
form expressions of standard speech. 


A.2.3 Constructs 


Next we consider combinations of units in order to form components. At this 
stage, we need to determine an important category of elements, which we 
term as the constructs. In order to introduce the basic idea about constructs, 
let us first consider an example from the S$ulbasütra. We have seen that a 
given phenomenon, like the syenacit, is analyzed into components like head 
(Siras) and the main core (atman), which are further dissected into components 
like the caturthi-bricks. Further, there are fundamental units, that are gained 
by analyzing the components. The question which can now be asked is: 
what can be constructed by combining the fundamental units in a rule based 
manner? As will soon become clear, we need to introduce a new category to 
represent the results of application of a rule-based procedure of combinations 
of the fundamental units. We call them constructs. For example, if we apply 
the rules mentioned in (BSuS. 1.3-4) then we can form a square (caturasra). 


The difference between a component and a construct is that while the 
former is a tangible part of the given phenomenon, the latter is an abstraction 
of it resulting out of rule-based combinations of the units. The passage from 
the square-shaped caturthi-bricks to the concept of square, and consequently 
supplying rules for its formation is the significant step, which according to 
Axel Michaels (1978 p. 17-20), indicates the origin of the science of geometry. 
In our case, constructs represent not only the concept of a square, but equally 
important is the condition that they are constructed out of rule-based combi- 
nations of the fundamental units. Hence the choice of the term. It needs to be 
clarified here, that the constructs are also tangible and physical entities and 
represent the generalized concept of that entity and not its instantiation in 
the world of phenomena. 
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Another example is construction of syllables from the set of phonemes. 
The rules mentioned on page 132 specify this. Here again, syllables represent 
the conceptual constructs that are generated through rule-based combina- 
tions of phonemes. It should be noted that not all the syllables that can 
be theoretically constructed are instantiated as actual syllables belonging 
to some linguistic expression (aksaras). Further, syllables as constructs are 
also physical entities. Syllables as components, however, are constituents of 
a given linguistic expression. A phoneme sequence like bmha, although a 
syllabic construct, is not a syllabic component of the Sanskrit language.*° 


Constructs are abstractions of the components that are generated through a 
rule-based combination of units. Those constructs that are also constituents 
of a given phenomenon are instantiated as components of that phenomenon. 
This distinction between the constructs and components, as we shall 
see, is important to differentiate between the process of synthesis and the 
possibilities for formalization. 


A.3 Structures, variations and change 


The application of the systematic approach on a set of phenomena results in 
the evolution of a comprehending structure. The nature of such a structure 
is an interconnected network of components on the basis of their characteris- 
tics. The rules of synthesis specify this network or interconnection. 


Thus, the rules of the Pratisakhyas provide for the possible connections 
and combinations of the components (i.e. the padas) and the units (or the 
phonemes). Similarly, the rules of the Sulbasütras specify the combinations 
of the components like the caturthi bricks as well as units or lines and points. 
In the case of grammar, again the rules provide for the interconnections of 
the components, i.e. which component can be combined with which other. 


The systematic approach of analysis and synthesis not only provides a 
mechanism to retain a given phenomenon, but also facilitates the recording of 
variations and brings about change. There are two ways in which variation is 
comprehended. Firstly, while formulating the rules, provision for alternatives 
is provided. Panini, for example, uses the expressions anyatarasyam, vibhasa 
and vi to accommodate the variations (Kiparsky 1980). Secondly, there are 
additional conditions which specify the varying contexts. In the case of the 
Astadhyayi, some of these contextual variations are those corresponding to 


36 Ulrich Stiehl provides a list of attested syllabic components. See http://www. 
sanskritweb.de/ (accessed on 09.07.2012). 
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the usage in the Vedic literature? or geographical? or even the opinions of 
others”. 


The existence of a system brings about consolidation of change by in- 
corporating the new or not yet comprehended phenomena within the 
descriptive structure. This is facilitated because the new phenomena partially 
contains components that are already collected in the structure and partially 
extra fresh components are to be conceived. Changes can thus be integrated 
by minor adjustments in the structure. 


7 Eg. fram Seah 12.3.38 lw In recitation of Veda (chandas), ekasruti is optional (vibhasa). 
38 E.g. US Wat Gai 2.2.94 ll > eri in case of expressions referring to the eastern region. 

39 E.g. HFA ACHAT 12.2.2 > according to Sakalya o(t) is termed pragrhya if it is 
sambuddhi and when iti that is not Vedic (arsa) follows. 
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The present work is a study of the Astadhyayr of Panini 
from a new perspective. It attempts to explore the 
Paninian system of Sanskrit grammar from a formal 
point of view and investigate the possibilities of rep- 
resenting it in a logical, explicit and consistent man- 
ner. It puts forward an appropriate framework for 
such a representation. Differing from the formulation 
of Astadhyayr, which is composed in an artificial yet 
natural language and is meant to be employed by in- 
dividuals who are acquainted both with the Sanskrit 
language and the techniques of grammar, the pres- 
ent rendering aims for a non-verbal representation in 
terms of mathematical categories and logical relations 
which can be implemented in an algorithmic manner. 
The formal framework suggested in this work would 
facilitate adequate tools for postulating and evaluating 
hypotheses about the grammatical system. Moreover, 
it would furnish the basis for a computer implemen- 
tation of the grammar. Both these aspects are objects 
of enquiry in the field of theoretical studies on Panini 
as well as the emerging discipline of Sanskrit computa- 
tional linguistics. This book takes on the ground-work 
in these areas. 
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